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@ Vascular endothelial cell growth factor C sub- 
unit DMA is prepared by polymerase chain reac- 
tion techniques. The DMA encodes a protein 
that may exist as either a heterodimer or 
homodimer. The protein is a mammalian vascu- 
lar endothelial cell mitogen and as such is 
useful for the promotion of vascular develop- 
ment and repair. This unique growth factor is 
also useful in the promotion of tissue repair. 
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BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 . Full length amino acid residue protein translation product and its cDNA coding sequence for VEGF 
AA subunit A plus polyp ptide cleavag products used to determine the amino acid s qu nee. 
5 Figure 2. Full length amino acid residue protein translation product and its cDNA coding sequence for VEGF 

AB subunit A plus polypeptide cleavage products used to determine the amino acid sequence. 

Figure 3. Full length amino acid residue protein translation product and its cDNA coding sequence for VEGF 
AB subunit B plus polypeptide cleayage products used to determine the amino acid sequence. 

Figure 4. Full cDNA coding sequence arid full length amino acid residue protein translation product for 
10 VEGF A 146 amino acid residue subunit SEQ ID NOS:23 & 33. 

Figure 5. Full cDNA coding sequence and full length amino acid residue protein translation product for 
VEGF A 1 90 amino acid residue subunit SEQ ID NOS:30 & 31 . 

Figure 6. Full cDNA coding sequence and full length amino acid residue protein translation product for 
VEGF A 214 amino acid residue subunit SEQ ID NOS:34 & 35. 
IS Figure 7. Full cDNA coding sequence and full length amino acid residue protein translation product for 

VEGF B 138 amino acid residue subunit SEQ ID NOS:36 & 37. 

Figure 8. Full cDNA coding sequence and full length amino acid residue protein translation product for 
VEGF B 158 amino acid residue subunit SEQ ID NOS:38 & 39. 

Figure 9. Full cDNA coding sequence and full length amino acid residue protein translation product for 
20 VEGF CI 54 amino acid residue subunit SEQ ID NOS:40 & 41. 

BACKGROUND OF THE INVENTION 

A new class of cell-derived dimeric mitogens with apparently restricted specificity for vascular endothelial 

25 cells has recently been identified and generally designated vascular endothelial growth factors (VEGFs). The 
mitogen has been purified from: conditioned growth media of rat glioma cells, [Conn et al., Proc. Natl. Acad. 
Sci. USA 87: 1323-1327 (1990)]; conditioned growth media of bovine pituitary folliculo stellate cells [Ferrara 
and Henzel, Biochem. Biophys. Res. Comm. 161: 851-858 (1989) and Gospodarowicz et a[., Proc. Natl. Acad. 
Sci. USA 86: 731 1-7315 (1989)]. An endothelial cell growth factor isolated fom mouse neuroblastoma cell line 

30 NB41 with an unreduced molecular mass of 43-51 kDa and a reduced mass of 23-29 kDa has been described 
by Levy et al., Growth Factors 2: 9-19 (1989). Connolly etal. (J. Biol. Chem. 264: 20017-20024 [1989]; J. Qin. 
Invest. 84: 1470-1478 [1 989]) describe a human vascular permeability factor that stimulates vascular endothe- 
lial cells to divide in vitro and promotes the growth of new blood vessels when administered into healing rabbit 
bone grafts or rat corneas. An endothelial cell growth factor has been purified finom the conditioned medium of 

35 the AtT-20 pituitary cell line by Plouet et a[., EMBO Jownal 8: 3801-3806 (1989). The growth factor was charac- 
terized as a heterodimer composed of subunits with molecular mass of 23 icDa. Leung etal. (Science 246: 1306- 
1309 [1989]), Keck et §[. (Science 246: 1309-1312 [1989]) and Conn et al. (Proc. Natl. Acad. Sci USA 87: 
2628-2632 [1990]) have described cDNAs which encode VEGF A which Is honrralogous to the A and B chains 
of platelet-derived growth factor. Vascular endothelial growth factor I (VEGF I, VEGF AA) is a homodimer with 

40 an apparent molecular mass of 46 kDa, with each subunit having an apparent molecular mass of 23 kDa. VEGF 
I has distinct structural similarities to platelet-derived growth factor (PDGF), a mitogen for connective tissu 
ceils but not vascular endothelial cells from large vessels. 

OBJECTS OF THE INVENTION 

45 

It is, accordingly, an object of the present invention to provide novel vascular endothelial growth factor C 
subunit DNA free of other mammalian DNA. Another object is to provide recombinant genes capable of exp- 
ressing VEGF C subunit monomer or dimer. Another object is to provkle vectors containing the DNA sequences 
for VEGF A or B plus C subunits. A further object is to provide a host cell transformed with a vector containing 
50 the DNA sequence for VEGF A or B plus C or VEGF C alone. It is also an object to provide a recombinant pro- 
cess for making VEGF C subunit. Another object is to provide a novel vascular endothelial cell growth factor 
which contains th C subunit. This may include heterodimers AC and BC and homodimer CC. 

SUMMARY OF THE INVENTION 

55 

Vascular endothelial cell growth factor C subunit DNA is prepar d by polymerase chain reaction techni- 
ques. The DNA ncodes a protein that may exist as either a heterodimer or honrK)dimer. The protein is a manv 
malian vascular endoth iial cell mitogen and as such is useful for the promotion of vascular developm nt and 
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repair. This uniqu growth factor is also useful in th promotion of tissue repair. 
DETAII-ED DESCRIPTION 

5 The present invention relates to a unique vascular endothelial cell growth factor (designated VEGF), iso- 

lated and purified from glioma cell conditioned medium, which exhibits mitogenic stimulation of vascular en- 
dothelial cells. Glioma is defined herein as any neoplasm derived from one of the various types of cells that 
fonm the interstitial tissue of the central nervous system including brain, spinal cord, posterior pituitary gland 
and retina. Consequentiy. the scope of the present invention is intended to include the unique growth factor 

10 isolated and purified from any mammalian tissue or ottier cells Including cell lines. Cell lines include, but are 
not limited to, glioma-derived cell lines such as C6, hs 683 and GS-9L; glioblastomas such as A-1 72 and T98G; 
neuroblastomas such as IMR-32 and SK-N-MC; neurogliomas such as H4; tetromas such as XB-2; 
astrocytomas such as U-87 MG and U-373 MG; embryonal carcinomas and non-transformed glial or astrocyte 
cell lines, and tfie human medulloblastoma line TE 671, with GS-9L and TE 671 being preferred. VEGF AB is 

15 present and can be isolated from rat tissue including ovary, heart and kidney. Anterior pituitary tumor cell lines 
such as GH3 and Hs 199 may also be used. It is Intended that VEGF of this invention can be obtained from 
any mammal species capable of producing VEGF. this includes, but is not limited to. rat and human. 

Vascular endothelial cell growth factor may exist in various microheterogeneous forms which are isolated 
from one or more of the various cells or tissues described above. Microheterogeneous forms as used herein 

20 refer to a single gene product, tfiat is a peptide produced from a single gene unit of DNA. which is stmcturaily 
modified at the mRNA level or following translation. Peptide and protein are used interchangeably herein. The 
microheterogeneous forms will all have similar mitogenic activities. Biological activity and biologically active 
are used interchangeably and are herein defined as the ability of VEGF to stimulate DNA synthesis in target 
cells including vascular endothelial cells as described below which results in cell proliferation. The modifications 

25 — may take place either in vivo; or during the isolation and purification process: In vivo modification results from. 

but is not limited to. proteolysis, glycosylation, phosphorylation, deamidation or acetylation at the N-tenminus. 
Proteolysis may include exoproteolysis wherein one or more temninal amino acids are sequentially, enzymati- 
cally cleaved to produce microheterogeneous fonns which have fewer amino acids than the original gene pro- 
duct. Proteolysis may also include endoproteolytic modification that results from the action of endoproteases 

30 which cleave the peptide at specific locations within the amino acid sequence. Similar modifications can occur 
during the purification process which also results in production of microheterogeneous forms. The most com- 
mon modification occurring during purification is proteolysis which is generally held to a minimum by the use 
of protease inhibitors. Under most conditions one or more microheterogeneous forms are present following puri- 
fication of native VEGFs. Native VEGFs refers to VEGF isolated and purged from cells that produce VEGFs. 

35 Vascular endothelial cell growth fector may also exist in various alternatively spliced forms which is defined 
herein as the production of related mRNAs by differential processing of exons and introns. Exons are defined 
as those parts of the DNA sequence of a eukaryotic gene that code for the final protein product. It is also in- 
tended that the present invention includes VEGF subunits A.B and C which are defined as comprising the full 
length translation products of all alternatively spliced mRNAs made from the gene encoding the subunits and 

40 their corresponding mature amino acid sequences generated by proteolytic removal of the amino terminal sec- 
retory leader amino acid sequences. It is further intended that the invention only include those micro- 
heterogeneous and alternatively spliced VEGF subunits which when in the dimeric form exhibit biological 
activity such as vascular endothelial cell stimulatk>n as discussed below. 

GItoma cells such as the rat cell line GS-9L are grown to confluence in tissue culture flasks, about 1 75 cm^. 

45 in a cell culture medium such as Dulbecco's Modified Eagle's Medium (DMEM) supplemented with about 1 0% 
newbom calf serum (NCS). When the cells reach confluence the culture medium is removed, the cell layers 
are washed with Ca*\ Mg**-free phosphate buffered saline (PBS) and are removed from the flasks by treatment 
with a solution of trypsin, about 0.1%,and EDTA, about 0.04%. The cells, about 1 x 10s. are pelleted by cen- 
trifugation, resuspended In about 1500 ml of DMEM containing about 5% NCS and plated into a ten level cell 

50 factory (NUNC). 6,000 cm2 surface area. The cells are incubated for about 48 to about 96 hours, with 72 hours 
preferred, at about 37** C In an atmosphere of about 5% CO2. Following incubation the medium is removed and 
the cell factories are washed at>out 3 times with PBS. About 1500 ml of fresh culture media is added containing 
about a 1 :2 mixture of Ham*s-F1 2/DMEM containing about 1 5 mM Hepes. pH about 7.4. about 5 ^ig/ml insulin, 
about 10 ^g/ml transferrin and with or without about 1.0 mg/ml bovine s rum albumin. This medium is replaced 

55 with fresh medium aft r about 24 hr and collect d every 48 hr thereaft r. The collect d conditioned m dium is 
filtered through Whatmen #1 paper to remove cell debris and stored at about -20** C. 

The GS-9L conditioned medium is thawed and brought to pH 6.0 with 1 M Ha. The initial purification step 
consists of cation exchange chromatography using a variety of cation exchangers on a variety of matrices such 
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as CM Sephadex C-50, Pharmacia Mono S, Zetachrom SP and Polyaspartic Acid WCX (Nest Group) with CM 
Sephadex C-50 (Pharmacia) being preferred. The VEGF-containing culture medium is mixed with CM 
Sephadex C-50 at about 2 gm per at>out 20 L of the conditioned medium and stirred at low speed for about 24 
hr at 4"" C. The resin is allowed to settJ and the excess liquid is removed. The r sin slurry is pack d into a 

5 column and the remaining culture medium Is removed. Unbound protein is washed from the column with 0.05 
M sodium phosphate, about pl-l 6.0, containing 0.15 M NaQ. The VEGF AB is eluted with about 0.05 M sodium 
phosphate, about pH 6.0, containing about 0.6 M NaCI. 

The active fractions collected from the CM Sephadex C-50 column are further fractionated by lectin affinity 
chromatography for additional purification of VEGF AB. The lectins which may bind VEGF AB include, but are 

10 not limited to, lectins which specifically bind mannose residues such as concanavalin A and lens culinaris ag- 
glutinin, lectins which bind N-acetylglucosamine such as wheat genm agglutinin, lectins that bind galactose or 
galactosamine and lectins which bind sialic acids, with concanavalin A (Con A) being preferred. A 0.9 cm d iame- 
ter column containing about 5 ml packed volume of Con A agarose (Vector Lat>oratories) is washed and equili- 
brated with about 0.05 M sodium acetate, about pH 6.0, containing at>out 1 mM CaCl2> about 1 mM MnCl2 and 

15 about 0.6 M NaCI. The unbound protein is washed from the column with equilibration buffer. The VEGF AB is 
eluted with about 0.1 M NaCI buffer containing about 0.32 M a-methyi mannoside and about 0.28 M a-methyl 
glucoside. 

The VEGF AB active eluate from the Con-A column is applied to a Polyaspartic Acid WCX cation exchange 
high performance liquid chromatography (HPLC) column, 4.6 mm x 250 mm, pre-equilibrated in about 0.05 M 
20 sodium phosphate buffer, pH 6.0. The column is eluted with a linear gradient of about 0 to 0.75 M NaCI in the 
phosphate buffer over about 60 minutes. The flow rate is maintained at about 0.75 ml/min collecting 0.75 ml 
fractions. Vascular endothelial cell growth factor AB activity Is present in fractions eluting between approxim- 
ately 21 .7 and 28.5 ml. 

The active fractions eluted from the polyaspartic WCX column that contain VEGF AB are pooled, adjusted 
25 to about pH 7.0 and loaded onto a 1 x 10 cm column of Pharmacia Chelating Sepharose 6B charged with an 
excess of copper chloride and equilibrated in about 0.05 M sodium phosphate, about pH 7.0, containing at>out 
2 M NaCI and about 0.5 mM imidazole (A buffer). VEGF AB is eluted from the column with a gradient from 0-20% 
B over 1 0 minutes, 20-35% B over 45 minutes and 35-1 00% B over 5 minutes at a flow rate of 0.3 ml/min, where 
B buffer is 0.05 M sodium phosphate, pH 7.0, containing about 2 M Nad and 100 mM imidazole. The active 
30 fractions containing VEGF AB activity eluted between about 12.6 and 22.8 ml of the gradient effluent volume. 

The pooled fractions containing VEGF AB activity eluted from the metal chelate column are loaded onto a 
4.6 mm x 5 cm Vydac 

C4 reverse phase HPLC column (5 ^m particle size) previously equilibrated in solvent A [0.1% trifiuoroacetic 
acid (TFA)]. The column is eluted with a linear gradient of about 0 to 30% solvent B over 15 minutes, 30% B 

35 for an additional 1 5 minutes, then 30-45% B over 22.5 minutes and finally 45-1 00% B over 5.5 minutes. Solvent 
B consists of solvent A containing 67% acetonitrile (v/v). The bow rate is maintained at about 0.75 ml/min and 
flractions are collected every minute. The homogeneous VEGF AB elutes from the C4 column under these con- 
ditions at between about 32 and about 38 ml of the gradient effluent volume. 

Purity of the protein is determined by sodium dodecylsulfate (SDS) polyacrylamide gel electrophoresis 

40 (PAGE) in 12.5% crosslinked gels using the technique of Laemmli, Nature 227:680-684 (1970). The sih^er 
stained gels show VEGF AB to consist of one band under non-reducing conditions with an approximate appa- 
rent molecular mass of about 58,000 daltons. When a sample containing the microheterogeneous fornis of 
VEGF AB is separated under reducing conditions it migrates as two about 23 kilodalton (kDa) subunits. The 
purification process results in VEGF AB that is essentially free of other mammalian cell products, such as pro- 

45 teins. Reoombinantiy derived VEGF AB will also be free of mammalian cell products. 

Biological activity is determined by mitogenic assay using mammalian vascular endothelial cells. Human 
umbilical vein endotfielial (HUVE) cells are plated on gelatin-coated dishes at a density of about 5000 cells per 
well in about 500 ^1 of Medium 1 99 (Ml 99) containing about 20% heat-inactivated fetal calf serum (FCS). Sanrv 
ples to be assayed are added at the time of plating. The tissue culture plates are incubated at about 37*" C for 

50 about 12 hours and about 2 microcuries of tritiated thymidine (NEN, 20 Cl/mmol) is added per ml of assay 
medium (1 .0 ^CiAvell). The plates are incubated for a further 60 hr, the assay medium is removed and the plates 
are washed with Hanks balanc d salt solution containing about 20 mM Hepes, about pH 7.5, and about 0.5 
mg/ml bovine serum albumin. The cells are lysed and the labelled DNAsolubiliz d with about 200 |il of a solution 
containing about 2 gm of sodium carbonat and about 400 mg sodium hydroxide in about 100 ml water. The 

55 Incorporated radioactivity was determined by liquid scintillation counting. The cone ntration of VEGF which eli- 
cited a half-maximal mitogenic response in HUVE cells was approximately 2 ± 1 ng/ml. The glycosaminoglycan 
heparin, which is r quired in thes assays at a level of 1 0-1 00 \xg/m\ to promote a response to a positive control, 
acidic fibroblast growth factor, do s not enhance mitogenic stimulation of thes cells by VEGF AB. 
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A purified about 1-2 ng sampi of VEGF AB is reduced in about 0.1 M Tris. about pH 9.5, with about 0.1% 
EDTA about 6 M guanidinium chlorid and about 20 mM dithiothreitol for about 2 hr at about 5Q*C. The reduced 
prot in is carboxymethylated for about 1 hour in a solution containing about 9.2 ^M of unlabelled and 2.8 ixM 
of ^*C-iodoacetic acid in about 0.7 M Tris, about pH 7.8, and about 0.1% EDTA and about 6 \A guanidinium 

5 chloride. The protein is carboxymethylated for about 1 hr at room temperature. The protein is isolated after re- 
duction and carboxymethylation by reverse phase HPLC chromatography on a Vydac C4 column, about 4.6 
mm X 5 cm. The protein subunits are loaded onto a column pre-equilibrated with about 0.1% TFA and eluted 
by a 45 ml linear gradient from about 0. 1 % TFA to 0. 1 % TFA/67% acetonitrile at a bow rate of about 0.75 ml/mln. 
The reduced and carboxymethylated protein eluted as two peaks at approximately 23 and 25 ml with the pro- 

io portion being approximately equal as detemnined by monitoring absorbance at 210 nm. 

Samples of the reduced and carboxymethylated monomers are applied to polybrene-coated glass fiber fil- 
ters and their N-terminal sequences are detemiined by Edman degradation in an ABI gas phase microsequ- 
encer in conjunction with an ABI 120A on line phenylthlohydantoin analyzer following the manufacturers 
instructions. The protein showing the peak of absorbance eluting at approximately 25 ml (A subunit or monomer) 

f5 yielded an amino terminal sequence of: SEQ ID NO:1 

Ala Pro Thr Thr Glu Gly Glu Gin Lys Ala His Glu Val Val 
which is identical to the A chain monomers of VEGF AA. Conn et al., Proc. Nati. Acad. Sci. USA 87: 2628-2632 
(1 990). The peak of absorbance eluting at approximately 23 ml (B subunit or monomer) yielded an N-terminal 
sequence of: SEQ ID NO:2 

20 Ala Leu Ser Ala Gly Asn Xaa Ser Thr Ser Thr Glu Met Glu Val Val 

Pro Phe Asn Glu Val 

plus a neariy equal amount of a truncated fomn of the same sequence missing the first three amino acid resi- 
dues. The missing Xxx residue corresponds to an Asn residue in the cloned cDNA, see below. Since this mis- 
sing Asn occurs In a classical Asn Xxx Ser/Thr N-glycosyiation sequence it is presumed to be glycosylated. 
— 25 -—The A subunit and the total of both B subunits are recovered in neariy equal amounts supporting the interpret" 
tation that the two peptides combine to form an AB heterodimer in VEGF AB. 

A sample of the A monomer was treated with either the protease trypsin which cleaves polypeptides on 
the C-tenminal side of lysine and arginine residues or Lys C which cleaves polypeptides on the Oterminal side 
of lysine by procedures well known in the art. The peptides are isolated by reversed phase - HPLC(RP-HPLC). 

30 The amino acid sequences of the isolated peptides are detemfiined using the Edman degradation in the ABI 
gas phase sequenator in conjunction with the AB1 120 A on line phenylthlohydantoin analyzer following man- 
ufacturer's instructk>ns. The amino acid sequences are shown in Figure 1. 

Reduced and carboxymethylated A monomer is dried and solubilized in about 0.7 M Tris, about pH 7.8, 
about 6 M guanidinium chloride containing about 0.1% EDTA V8 protease is added in 0. 1 M ammonium bicar- 

35 bonate buffer, about pH 8.0, and the mixture is incubated for about 48 hr at about 37**C. The protease cleaves 
predominantly on the carboxyl tenmlnal side of glutamic acid residues. The resulting polypeptides were resolved 
by Cns RP-HPLC as above. 

The reduced and carboxymethylated A subunit protein solution is adjusted to a pH of^bout 6.8 with 6 N 
HCI and dithiotreitol is added to a final concentratk>n of 2 M for reduction of any methionine sulfoxide to 

40 methionine reskJues. After about 20 hr of reduction at about 39**C the protein is repurified by C4 HPLC. The 
product is dried and cleaved on the carboxyl terminal side of methionine residues by 200 y^ of 40 mM cyanogen 
bromide in about 70 % (v/v) fonmic acid under an argon atmosphere at about 20**C for about 24 hr in the dark. 
The deavage products are resolved by Cis RP-HPLC, The amino acid sequence is shown in Figure 1 , see Conn 
et al.. Proc. Natl. Acad. Sci USA 87:2628-2832 (1990). 

45 The full length 1 90 amino acid residue protein translation product of the VEGF AB, A monomer or subunit, 

which is now known to be identical with the VEGF AA, A monomer, and its cDNA coding sequence are shown 
in Figures 2 and 6. The mature amino terminus begins at residue 27, immediately following a typical hydrophobic 
secretory leader sequence. A single potential N-glycosyiation site exists at Asnioo- Most (143 amino acid resi- 
dues) of the 164 residues of the reduced and carboxymethylated mature subunit including the amino terminus 

so and HPLC reversed phase-purified products of tryptic (T), Lys-C (L), Staphvlococcus aureus V8 protease (V8) 
and cyanogen bromkie (CB) cleavages, were determined by direct microsequencing (Applied Biosystems 
470A) using a total of 6 ng of protein. All residu s id ntifi d by amino acid sequencing are denoted by arrows 
pointing to the right either dlr ctly beneath th mature processed sequence following the brack t at residu 27 
for the amino terminal det nmination of the whole subunit or, for residues identifi d from th polypeptide deav- 

55 age products, above the double-headed arrows spanning the I ngth of th particular polypeptide. One listed 
pair of polypeptides, V1 8A and V1 8B, was sequenced as a mbctur and. therefore, are only confirmatory of the 
cDNA-deduced amino acid sequence, s Figures 1 and 5. 

Samples of th r duced and carboxymethylat d pure VEGF AB, A and B monomers, were each digested 
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with the Lys-C endoproteinase, which cl aves polypeptides on the C-terminal sid of lysine residues. The pep- 
tides were isolated by reverse phase HPLC and their amino add s quences were detenmined as described 
above. The locations of the peptides in the final VEGF AB, A and B sequences are shown in Figure 2 and Figure 
3, respectively. 

5 The full length coding region of tiie A subunit or monomer is determined from three sets of overlapping 

cDNA clones. Degenerate oligonucleotide primers based on the amino acid sequences Phe-Met-Asp-Val-Tyr- 
Gln from polypeptide L42 (residues 42-47) and Cys-Lys-Asn-Thr-Asp from polypeptide T38 (residues 1 64-1 68) 
(see Figure 1) were used to PGR amplify the central region of the cDNA for VEGF A chain following the pro- 
cedure of SaikI et Science 230: 1 350- 1 354 (1 985). A single band migrating at 420 bp was gel purified, di- 

10 gested with Sail, ligated into pGEM3Zf(+) and sequenced. The nucleotide sequence obtained (p4238) was us d 
to design antisense and sense PGR primers to amplify the 5' and 3' ends of the cDNA according to the protocol 
described by Frohman et a[. Proc. Nati. Acad. Sci. USA 85:8998-9002 (1988). These 5' and 3' clones are de- 
noted p5-15 and pW3, respectively. Regions of complete DMA sequences, excluding tiie primers, determined 
for each set of clones are indicated by double-headed arrows above the nucleotide sequence. In addition to 

IS the cDNA coding the 164 amino acid secreted fomn identified by protein sequencing, two alternatively spliced 
cDNAs encoding a 146 amino acid and a 214 amino acid forms are cloned and sequenced. Figures 4, 5 and 
6. 

The full length coding region of the B subunit or monomer Is determined from four sets of overlapping cDNA 
clones. Degenerate oligonucleotide primers based on the amino acid sequences from polypeptide L50 are used 

20 to PGR amplify the central region of the cDNA for VEGF AB, B monomer, following the procedure of Saiki et 
a[.. Science 230: 1350-1354 (1985). A single band migrating at 108 bp was gel purified, digested with Sail, 
ligated into pGEM3Zf(+) and sequenced. The nucleotide sequence obtained (pYG) was used to design anti- 
sense and sense PGR primers to amplify the 5' and 3' ends of the cDNA according to the protocol described 
by Frohman etal. Proc. Nati. Acad. Sci. USA 85:8998-9002 (1988). These 5' and 3' clones are denoted p5V2 

25 and p3V2, respectively. Additional 5' end sequences are determined from clone 202 isolated from a cDNA iit>- 
rary prepared from GS-9L poly A* RfslA. Regions of complete DMA sequences, excluding the primers, deter- 
mined for each set of clones are indicated by double-headed arrows above the nucleotide sequence. The entire 
base sequence for the 158 amino acid microheterogeneous B subunit and the 138 amino acid micro- 
heterogeneous B subunit are shown In Figures 7 and 8. 

30 The full length coding region of the C subunit or monomer is determined from three sets of overiapping 

cDNA clones. Degenerate oligonucleotide primers based on the amino acid sequence Phe Ser Pro Ser Gys 
Val and Glu Met Thr Phe Ser Gly from rat VEGF B subunit are used to PGR amplify the central region of the 
cDNA of VEGF C chain following the procedure of Saiki etal., Science 230: 1 350-1 354 (1 985). A band migrating 
at 180 bp is gel purified, reamplified and digested with Sail, ligated into pGEM3Zf(+) and sequenced. The nuc- 

35 leotide sequence obtained (pFSEM') is used to design antisense and sense PGR primers to amplify the 5' and 
3' ends of the cD^4A according to the protocol described by Frohman etal .. Proc. Nati. Acad. Sci. USA 85: 8998- 
9002 (1988). The 5' and 3' clones are denoted p5:16 and p3:19, respectively. The entire base sequence and 
amino acid sequence for the G subunit are shown in Figure 9. 

it is intended that vascular endothelial cell growth factor of the present invention exist as a heterodimer 

40 consisting of an A microheterogeneous and/or altematively spiiced subunit or a B microheterogeneous and/or 
alternatively spliced subunit combined with a G microheterogeneous and/or altematively spliced subunit It is 
further intended that VEGF homodimer of the present invention exist as two C subunits. The native forms of 
the A, B, G subunits may be processed form altematively spliced full length translation products. The heterodh 
mers or heterodimeric species can be depicted as: A+B, A+G or B+G with the A, B or G subunits existing in 

45 any of the altematively spliced or microheterogeneous forms. The homodimers or homodimeric species can 
be formed by combinations of any of the altematively spliced or microheterogeneous forms, it is also intended 
that the invention include all of the individual subunit forms of the A subunit, the B subunit and the G subunit 
of VEGF. 

It is further intended that the nucleotide sequence for vascular endothelial cell growth factor be interpret d 
so to include all codons that code for the appropriate amino acids in the sequence for each of the vascular en- 
dothelial growth factor subur^lts, as indicated by the degeneracy of the genetic code. It is further intended that 
the nucleotide sequence and the amino acid sequence for VEGF subunits include truncat d g nes or proteins 
that result in proteins which exhibits biological activity similar to vascular ndothelial cell growth factor. The 
scop of the invention is intended to include all naturally occwring mutations and allelic variants and any ran- 
55 domly generated artifical mutants which may chang the sequenc s but do not alt r biological activity as de- 
termined by the ability to stimulate the division of vascular endothelial cells. 

The abov described heterodimers, homodimers and subunits f vascular endothelial cell growth factor 
are characterized by being the products f chemical synthetic proc dures or of procaryotic or ucaryotic host 
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xpression of the DNAs qu ncesasd scribed herein. A monomer is d fin d as a subunit that is not incorpo- 
rated in an ollgomeric unit Expression of the recombinant VEGF genes (recombinant DNA) is accomplished 
by a number of differ nt host cells which contain at least one of a numb r of expression vectors. Expression 
vectors are defined herein as DNA sequences that are required for the transcription of cloned copies of recom- 
5 binant DNA sequences or genes and the translation of their mRNAs in an appropriate host Such vectors can 
be used to express genes in a variety of hosts such as bacteria, bluegreen algae, yeast cells, insect cells, plant 
cells and animal cells, with mammalian cells being preferred. The genes may also be expressed using any of 
a number of virus expression systems. Specifically designated vectors allow the shuttling of DNA between bac- 
teria-yeast, bacteria-plant or bacteria-animal cells. An appropriately constructed expression vector should con- 

10 tain: an origin of replication for autonomous replication in host cells, selective markers, a limited number of 
useful restriction enzyme sites, a high copy number, strong promoters and efficient translational stop signals. 
A promoter is defined as a DNA sequence that directs RNA polymerase to bind to DNA and to initiate RNA 
synthesis. A strong promoter is one which causes mRNAs to be initiated at high frequency. Expression vectors 
may include, but are not limited to, cloning vectors, modified doning vectors, specifically designed plasmids 

IS or viruses and cosmids. The expression of mammalian genes in cultured mammalian cells is well known in the 
art Sambrook et a[., Molecular aoning, A Laboratory Manual, 2nd Edition, Book 3, Cold Springs Harbor 
Laboratory Press (1989) and Cwrent Protocols In Molecular Biology, Ausubel et. a[. Eds, Greene Publishing 
Associates and Wiley-lnterscience, 1987 and supplements, disclose various mammalian expression vectors 
and vector systems along with methods for the introduction of recombinant vectors into mammalian cells. The 

20 cDNA for the monomemc fonms of the A, B and C subu nits can be expressed in a system such as that described 
by Linemeyer et a[.. European Patent Application, Publication No. 259,953. The cDNA is incorporated into a 
commercially available plasmid such as pKK 223-3 (Phanmacia) as modified as by Linemeyer et al. and exp- 
ressed in E. con. Other expression systems and host cells are well known in the art. 

The high Cys content and glycoslyatlon sites of the A, B and C subunlts along with the structure of the homo- 
-25- —and heterodimers suggest that expression of biologically active proteinscanbeisarried'o^^ 

ression may be carried out in Chinese hamster ovary (CHO) cells with the cloned VEGF DNA cotransfected 
with the gene encoding dihydrofblate reductase (dhfir) into dhfr- CHO cells, see Sambrook etal. Transformants 
expressing dhfr are selected on media lacking nucleosides and are exposed to increasing concentrations of 
methotrexate. The dhfr and VEGF genes are thus coamplified leading to a stable cell line capable of expressing 

30 high levels of VEGF. The plasmid is designed to encode either an A subunit, a B subunit or a C subunit or a 
combination of any two of these subunlts. The two cDNAs are operably attached so that the protein produced 
will be dimeric and will have VEGF biological activity. Operably attached refers to an appropriate sequential 
arrangement of nucleotide segments, cDNA segments or genes such that the desired protein will be produced 
by cells containing an expression vector containing the operably attached genes, cDNA segments or nuc- 

35 leotides. Plasmids containing a single subunit species may be used to cotransfect a suitable cell line. 

The expressed proteins (homodimers or heterodimers) are isolated and pwified by standard protein puri- 
fication processes. It is to be understood that the expression vectors capable of expressing heterodimeric forms 
of VEGF will contain two DNA sequences which will encode either an A subunit and/or a DNA sequence which 
will encode a B subunit and/or a DNA sequence which will encode a C subunit. Expression vectors capable of 

40 expressing homodimeric forms of VEGF will contain either one or two DNA sequences which encode either 
two A, two B or two C subunits. 

The ability of the various species of VEGF to stimulate the division of vascular endothelial cells makes this 
protein in all microheterogeneous forms and alternative splicing forms useful as a pharmaceutical agent The 
protein as used herein is intended to include all microheterogeneous forms as previously described. The protein 

45 can be used to treat wounds of mammals including humans by the administration of the novel protein to patients 
In need of such treatment 

The novel method for the stimulation of vascular endothelial cells comprises treating a sample of the desired 
vascular endothelial cells in a nutrient medium with mammalian VEGF, preferably human or rat, at a concen- 
tration of about 1-10 ng/ml. If the vascular endotfielial cell growth is conducted in vitro , the process requires 
50 the presence of a nutrient mediuin such as DMEM or a modification thereof and a low concentration of calf or 
bovine serum such as about 0 to 2% by volume. Preservatives such as antibiotics may also be Included; these 
are well known in the art 

The novel growth factors of this invention are us ful for the coverage of artificial blood vessels with vascular 
endothelial cells. Vascular endothelial cells from th patient would be obtained by removal of a small sequent 
55 of peripheral blood vessel or capillary-containing tissue and th desired cells would be grown in culture in th 
presence of VEGF and any other supplemental components that might be r quired for growth. After growth of 
ad quat numbers of ndothelial cells in culture to cover a synthetic polymeric blood vess I the cells would 
be plated on the inside surface of the vessel, such as fixed umbilical vein, which is then implanted in the patient 
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Alternatively, tubular supports are coated In vitro with VEGF prior to implantation into a patient. Following inn- 
plantation endothelial cells migrate into and grow on the artificial surface. Prior coating of the artificial vessel 
ither covalentiy or noncovalentJy. with proteins such as fibrin, collagen, fibronectin or laminin would b p r- 
fonned to nhance attachment of the cells to the artificial surface. The cell-lined artificial vessel would then b 

5 surgically Implanted Into the patient and, being lined with the patients own cells, would be immunologically com- 
patible. The non-thrombogenic endothelial cell lining should decrease the incidence of dot formation on the 
surface of the artificial vessel and thereby decrease the tendency of vessel blockage or embolism elsewhere. 

The novel proteins are also used for the production of artificial vessels. Vascular endothelial ceils and 
smooth muscle cells from the patient would be obtained and grown separately in culture. The endothelial cells 

10 would be grown in the presence of VEGF as outlined above. The smooth muscle would t>e grown in culture by 
procedures well known in the art. A tubular mesh matrix of a biocompatible polymer (either a synthetic polymer, 
with or without a coating of proteins, or a non-immunogenic biopotymeric material such as surgical suture 
thread) would be used to support the culture growth of the smooth muscle cells on the exterior side and vascular 
endothelial cells on the Interior surface. Once the endothelial ceils fonm a confluent monolayer on the inside 

15 surface and multiple layers of smooth muscle cells cover the outside, the vessel is implanted Into the patient. 

The novel peptides can also be used for the induction of tissue repair or growth. The pure VEGF would be 
used to induce and promote growth of tissue by inducing vascular growth and /or repair. The peptide can be 
used either topically for tissue repair or intravascularly for vascular repair. For applications involving neovas- 
cularization and healing of surface wounds the formulation would be applied directly at a rate of about 10 ng 

20 to about 1 mg/cm^/day. For vascular repair VEGF is given intraveneously at a rate of about 1 ng to about 100 
^g/kg/day of body weight For internal vascular growth, the formulation would be released directly Into the region 
to be neovascularized either from implanted slow release polymeric material or from slow release pumps or 
repeated injections. The release rate in either case is about 10 ng to about 100 ng/day/cm^. 

For non-topical application the VEGF is administrated in combination with pharamaceutically acceptabl 

25 cani ers or diluents such as, phosphate buffer, saline, phosphate buffered saline. Ringer's solution, and th 
like, in a pharamaceutical composition, according to standard pharamaceutical practice. For topical applicatk>n, 
various pharmaceutical formulations are useful for the administration of the active compound of this inventk)n. 
Such formulations include, but are not limited to, the following: ointments such as hydrophilic petrolatum or 
polyethylene glycol ointment; pastes which may contain poms such as xanthan gum; solutions such as alcoholic 

30 or aqueous solutions; gels such as aluminum hydroxide or sodium alginate gels; albumins such as human or 
animal albumins; collagens such as human or animal collagens; celluloses such as alkyi celluloses, hydroxy 
alkyi celluloses and aikylhydroxyalkyi celluloses, for example methylcellulose, hydroxyethyl cellulose, car- 
boxymethyl cellulose, hydroxypropyl methylcellulose, and hydroxypropyt cellulose; polyoxamers such as Plu- 
ronic® Polyols exemplified by Pluronic® F-127; tetronlcs such as tetronic 1508; and alginates such as sodium 

35 alginate. 

The following examples illustrate the present invention without, however, limiting the same thereto. 
EXAMPLE 1 

40 Preparation of Medium Conditioned By GS-9L Cells 

GS-9L cells were grown to confluence in 175 cnri^ tissue culture flasks In Dult>ecco's Modified Eagle's 
Medium/10% newborn calf serum (DMEM/NCS). At confluence the medium was decanted from the flasks, the 
flasks were washed with calcium and magnesium free phosphate buffered saline (PBS) and the cells were re- 

45 moved by treatment with a IX solution of trypsin/EDTA (Gibco). The cells (1 x 10^) were pelleted by centrifu- 
gation, resuspended in 1500 ml of DMEM/5% NCS and plated into a ten level (6000 cm^ surface area) cell 
factory (NUNC). After 72 hours incubation at 37'' C in a 5% CO2 atmosphere the medium was decanted and 
the cell factories were washed 3 times with PBS. The cells were refed with 1500 ml of a 1 :2 mixture of Ham's 
F-12/DMEM containing 25 mM Hepes, pH 7.4, 5 \ig/m\ insulin, 10 pg/ml transferrin and 1 .0 mg/ml bovine serum 

5a albumin. This medium was changed with fresh F-12/DMEM after 24 hours and collected every 48 hours after 
that. The conditk>ned medium was filtered through a Whatman #1 paper to remove cell debris and stored frozen 
at-20*»C. 

EXAMPLE 2 

55 

Carboxymethyl-Sephadex Chromatography of VEGF AA and VEGF AB 



GS-9L condition d medium, from ExampI 1 , was thawed and brought to pH 6.0 with 1 M HCl. Two grams 
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of CM Sephadex C-50 cation exchange (Pharmacia) resin preequilibrated in PBS adjusted to pH 6.0 with 1 N 
HCI w re added to 20 liters of conditioned medium. The mixture was stirred at low speed for 24 hours at 4** C. 
The resin was then allowed to settle and the m dium was siphoned off. The remaining r sin slurry was packed 
into a 3.0 cm diameter column and any remaining medium was allowed to drain off. Unt)ound protein was 
washed off the column with 0.05 M sodium phosphate. pH 6.0, containing 0.15 M NaQ. Vascular endothelial 
growth factor activity was eluted from the column with a sut>sequent wash of 0.05 M sodium phosphate, pH 
6.0, containing 0.6 M NaCI. 

EXAMPLE 3 

Concanavalin A (Con A) Lectin Affinity Chromatography of VEGF AA and VEGF AB 

A 0.9 cm diameter column containing about 5 ml of packed Con A agarose (Vector Laboratories) was equili- 
brated with 0.05 M sodium acetate. pH 6.0, containing 1 mM Ca*^ 1 mM Mn++ and 0.6 M NaCI. The active eluate 
from the CM Sephadex C-50 column. Example 2, was applied to the Con A agarose and unbound protein was 
washed from the column with equilibration buffer. The column was then rinsed with three column volumes of 
0.05 M sodium acetate, pH 6.0. containing 1 mM Ca+^ 1 mM Mn'"'' and 0.1 M NaQ. Bound protein was sub- 
sequently eluted from the column by application of this buffer supplemented with 0.32 M a-methyl mannoside 
and 0.28 M a-methyl glucoside. 

EXAMPLE 4 

Polyaspartic Acid WCX HPLC Cation Exchange Chromatography of VEGF AA and VEGF AB 

The active eluate from the Con A column; Example 3; was applied to a 25 cm"x 4:6Tnrfrp6ly(as^ 
WCX cation exchange HPLC column (Nest Group) pre-equilibrated in 0.05 M sodium phosphate buffer, pH 6.0. 
The column was eluted with a linear gradient of 0 to 0.75 M NaCI in this buffer over 60 minutes at a flow rate 
of 0.75 mi/min collecting 0.75 ml fractions. VEGF AB acth^ity present in fractions eluting between approximately 
21.7 and 28.5 ml were pooled. 

EXAMPLE 5 

Metal Chelate Chromatography 

The active fractions eluted from the poly(aspartic acid) WCX column. Example 4, that contain VEGF AB 
were pooled, adjusted to pH 7.0 and loaded onto a 1 x 10 cm column of Phanmacia Chelating Sepharose 6B 
charged with an excess of copper chloride and equilibrated in 0.05 M sodium phosphate, pH 7.0, containing 2 
M NaCI and 0.5 mM imidazole (A buffer). VEGF AB was eluted from the column with a gradient from 0-20% B 
over 1 0 minutes, 20-35% B over 45 minutes and 35-1 00% B over 5 minutes at a flow rate of 0.3 ml/min, where 
B buffer was 0.05 M sodium phosphate. pH 7.0. containing 2 M NaCI and 100 mM imidazole. The active fractions 
containing VEGF AB activity eluting between 12.6 and 22.8 ml of the gradient effluent volume were pooled. 

EXAMPLE 6 

Reverse Phase Chromatography 

The fractions containing VEGF AB activity pooled from the metal chelate column, Example 5 were loaded 
onto a 4.6 mm X 5 cm Vydac C4 reverse phase HPLC column (5 |im partide size) equilibrated in solvent A (0. 1 % 
trifluoroacetic acid (TFA)). The column was eluted with a gradient of 0-30% solvent B over 15 minutes. 30% B 
for an additional 15 minutes, then 30-45% B over 22.5 minutes and finally 45-100% B over 5.5 minutes where 
solvent B = A containing 67% acetonitrile. The flow rate was maintained at 0.75 ml/min. The aclh^e VEGF AB 
fractions eluting between approximately 32.2 and 37.5 ml of the gradient effluent volume w re pooled. 

EXAMPLE 7 

Mitogen ic Assays 

Human umbilical vein endothelial cells (HLA/E) were plated on gelatin-coated 48 well tissue culture dishes 
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at a density of 5000 cells/well in 500 |il of Medium 199 containing 20% heat inactivated fetal calf serum (FCS). 
Samples to be assayed were add d at th tim of plating. The tissue culture plates ar incubated at 37'' C for 
12 hours and 2 microcuries of tritiated thymidine (NEN, 20 Ci/mmol) was add d per ml of assay medium (1 .0 
^CiA^ell). The plates were incubated for a further 60 hr, the assay medium was removed and the plates were 

5 washed with Hanks balanced salt solution containing 20 mM Hopes, pH 7.5, and 0.5 mg/ml bovine serum albu- 
min. The cells were lysed and the labelled DNA solubilized with 200 ftl of a solution containing 2 gm of sodium 
carbonate and 400 mg sodium hydroxide in 100 ml water. The incorporated radioactivity was determined by 
liquid scintillation counting. 

The concentration of VEGF AB which elicited a half-maximal mitogenic response in HUVE cells was ap- 

10 proximately 2 ± 1 ng/ml. The glycosaminoglycan heparin, which is required in these assays at a level of 1 0-1 00 
^g/ml to promote a response to a positive control, acidic fibroblast growth factor, does not enhance mitogenic 
stimulation of these cells by VEGF AB. 

EXAMPLES 

15 

Purity And Protein Structural Characterization of VEGF AB 

Purity of the protein under non-reducing conditions was determined by SDS-PAGE in 12.5% crosslinked 
gels according to the method of Laemmli, Nature 227: 680-685 (1970). The silver-stained gel contained a single 
20 band with an apparent mass of approximately 58 kDa. VEGF AB migrated in SDS-PAGE under reducing con- 
ditions in 15% crosslinked gels as a broad silver-stained band with apparent molecular mass of approximately 
23 kDa. 

VEGF AB was stored a A'^C In the aqueous trifluoroacetic acid (TFA)/acetonitrile mixture used to elute th 
homogeneous protein in reversed phase C4 HPLC chromatography at the final stage of the purification protocol 

25 previously described. Aliquots of the purified protein (1-2 ^g) were vacuum evaporated to dryness in acid-was- 
hed 10 X 75 mm glass tubes and reduced for 2 hours at 50*'C in 100 ^l of 0. 1 M Tris buffer, pH 9.5, and 6 M 
guanidinium chloride containing 0.1% EDTAand 20 mM dithiothreitol (Calbiochem, Ultrol grade) underan argon 
atmosphere. The reduced protein was subsequently carboxymethylated for 1 hour at 20*^*0 by the addition of 
100 ^l of 0.7 M Tris, pH 7.8, containing 0.1 % EDTA, 8 M guanklinium chloride, 9.2 \M unlabeled iodoacetic 

30 acid and 50 ^Ci of iodo[2-''^ CJacetic acid (1 7.9 mCi/mmole. Amersham). After completion of the carboxymethy- 
lation, the mixture was loaded directly onto a 4.6 mm x 5.0 cm Vydac C4 column which had been preequilibrated 
in 0.1 % TFA The reduced and carboxymethylated protein was repurified by elution with a 45 minute linear gra- 
dient of 0 to 67% (vA^) acetonitrile in 0.1% TFA at a flow rate of 0.75 ml/min and stored in this elution solution 
at 4'*C. The reduced and carboxymethylated protein eluted as two peaks at approximately 23 and 25 rrd that 

35 were of approximately equal area as detenmined by monitoring absorbance at 210 nm. 

Samples of the two protein subunits isolated after reduction and carboxymethylation were each applied to 
polybrene-coated glass fiber filters and their N4erminal sequences were determined by Edman degradation In 
an ABI gas phase microsequencer in conjunction with an ABI 120A on line phenylthiohydantoin analyzer fol- 
lowing manufacturers instructions. The peak of absorbance eluting at approximately 25 ml (A subunit) yielded 

40 an amino tenninal sequence Ala Pro Thr Thr Glu Gly Glu Gin Lys Ala His Glu Val Val SEQ ID NO: 1 identical 
to VEGF AA. The peak of absorbance eluting at approximately 23 ml (B subunit) yielded the N-tenminal sequ- 
ence Ala Leu Ser Ala Gly Asn Xaa Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val SEQ ID NO: 2 plus a 
neariy equal amount of a truncated form of the same sequence missing the first three residues. The missing 
X residue corresponds to an Asn in the cloned sequence. Since this missing Asn occurs in a classical Asn-X- 

45 Ser/Thr N-glycosylation sequence it is presumed to be glycosylated. The A and sum of the B chain peptides 
were recovered in neariy equal amounts supporting the Interpretation that the two peptides combine to form 
an AB heterodimer in VEGF II. 

Reduced and carboxymethylated A and B subunits (650 ng each) were each dried by vacuum evaporation 
in acid-washed 1 0 x 75 mm glass tubes. Lys G protease (50 ng, Boehringer Mannheim), an enzyme that cleaves 

50 on the carboxyl terminal side of lysine residues, was added to each tube in 100 ^1 of 25 mM Tris, pH 8.5, 0.1 
% EDTA The substrate protein subunits were separately digested at37''Cfor 8 hours and the resulting poiypep>- 
tides resolved by r versed phase HPLC chromatography on a 4.6 mm x 25 cm Vydac C^b column equilibrated 
in 0.1% TFA. Polypeptides were fractionated by elution with a 2 hour linear gradi nt of 0-67% acetonltril in 
0.1% TFA at a flow rate of 0.75 ml/min at 20'*C. indh^idual peaks were manually collect d and stored in this 

55 elution solution at 4*^0. 

The amino ackj sequences of th isolated peptides were th n detenmined using Edman degradation in an 
ABI gas phase sequenator in conjunction with the ABI 120 A on line phenylthiohydantoin analyzer (Applied 
Biosystems InL). Th peptide s quences are shown in the following Figures 2 and 3. The amino acid s quence 
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of Lys C fragment L20 (Fig. 5) demonstrates that th fomn of VEGF AB mature A subunit in the heterodimer is 
the 164 amino acid fomn. Th amino acid sequence of Lys C fragment L26 (Fig. 3) demonstrates that the form 
of VEGF AB mature B subunit in the heterodimer is th 135 amino acid forni derived from the 158 full length 
amino acid fbnm. 

5 

EXA1^PLE9 

Cloning and Sequencing of the VEGF A Monomer 

10 PGR Amplification, Cloning and Sequencing of P4238 

Two degenerate oligonu leotides were synthesized in order to amplify the cDNA encoding the peptide se- 
quences of VEGF A subunit between LysC fragment L 42 and tryptic fragment T38. These oligonucleotides 
were: 

15 

5' TTTGTCGACTTtTC]ATGGAITC]GT[l^TA[TC]CA 3* 
SEQIDN0:3 
20 TSSSB 

5' CAGAGAATT CGTCGACA[AG]TC[N]GT[AG]TT[TC]TT 
[AG]CA 3' SEQ ID NO:4 

-25 where N=AeGT 

Poly A-*- RNA was isolated from GS-9L cells using the Fast Track RNA isolation kit from Invitrogen and the 
protocol provided. First strand cD^4A synthesis was perfbmned as follows; 

1 \iQ of GS-9L RNA was annealled to 1 \xg of adapter primer TA17, 
5' GACTCGAGTCGACATCGAI I I I I I I I I I II I I I I I 3' SEQ ID NO:5, by incubating in a volume of 10 ^1 at 
30 70''C for 5 min. followed by cooling to room temperature. To this reaction was added: 
3.0 ^l water 

2.5 ^l 10X buffer (500 mM Tris-HQ pH 8.3, 750 mM KCI. 100 mM MgCla. 5 mM spennidine) 
2.5 nl lOOmMDTT 

2.5 jil 10 mM each dATP, dGTP. dCTP, dTTP 
35 0.6 til 15 units RNasin 

2.5 111 40 mM Na pyrophosphate 
1 .5 |i] 1 5 units reverse transcriptase 

and the reaction was incubated at 42''C for 1 hour, then diluted to 1 ml in 10 mM Tris-HCI I mM EDTA, 
pH 7.5. 



40 



PCR Reactions: 



Primary reaction (100 ^1) 
1 0 nl 1 ox buffer from Perlcin Elmer Cetus GeneAmp kit 
45 16 1 .25 mM each stock of dATP, dCTP. dGTP, and dTTP 

2 ^1 first strand GS9L cDNA 
2 ^1 50 pMoles L42.2 
2 ^1 50 pMoles T383' B 
0.5 ^1 2.5 units Amplitaq DNA polymerase 
50 67.5 ^1 water 

Reactk>n conditions, 40 cycles of 94*»C, 1'; 50X, 2'30"; 72**C, 2', 

Prep scale secondary reaction: 

55 100^1 1 0X buffer 

160 111 1 .25 mM each stock of dATP, dCTP. dGTP, and dTTP 

10^1 primary PCR reactk>n 

20 ^1 500 pMoles L42.2 
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20^1 SOOpMoI sT383'B 

5\i\ 25 units Amplitaq DNA polymerase 

685 ^1 water 

Reaction conditions 94*»C, 1'; 55°C. 2'; 72^C, 2'; 30 cycles. 
5 The PCR product was concentrated by Centricon 30 spin columns, purified on a 1 % agarose gel, and di- 

gested with restriction endonuclease Sail. The Sail fragment was then llgated into Sail cut pGEM3Zf(+). The 
ligation mix was used to transfbnm E. cdi XL-1 blue. Plasmid DNA was isolated from white transfr)mnants and 
sequenced by the dideoxy chain termination method. 

10 PCR Amplification, Cloning and Sequencing of pW-3 

Based on the sequence obtained from the p4238 clones, two specific PCR primers were synthesized; oligo 
307 5' TTTGTCGACTCAGAGCGGAGAAAGC 3' SEQ ID NO:6 and olfgo 289 5' TTTGTCGACGAAAAT- 
CACTGTGAGC 3' SEQ ID NO:7. These primers were used in combination with oligoA 17 
15 5'GACTCGAGTCGACATCG 3' SEQ ID NO:8 to amplify the cDNA encoding the COOH terminus of VEGF A 
subunit using the 3' RACE technique described by Frohman et al., PNAS 85: 8998-9002 (1988). 



PCR reactions: 



20 Primary reaction 100 ^1 

1 0 ^l 1 0X buffer from Perkin Elmer Cetus GeneAmp kit 

1 8 ^il 1 .25 mM each stock of dATP, dCTP, dGTP, and dTTP 

0.35 ^1 first strand GS-9L cDNA 

2 ^1 50 pMoles oligo 289 

25 0.5 ^1 2.5 units Amplitaq DNA polymerase 

67.15^1 water 

Reaction conditions 94**C. V; 58*C, 2'; 72*'C, 2'; 10 cycles then add 50 pMoles A17, then 1 cyde of 94**C, 1'; 
58*»C. 2'; 72*»C, 40' followed by 40 cycles 94*»C, 1'; 58*»C, 2'; 72*^C. 2'. 



30 Prep Scale secondary reaction: 



60|il 1 0X buffer 

1 08 fil 1 .25 mM each stock of dATP, dCTP, dGTP, and dTTP 

24 |il primary PCR reactbn 
35 ^2\l\ 300 pMoles oligo 307 

1 2 ^1 300 pMoles oligo A1 7 

3 ^1 15 units Amplitaq DNA polymerase 

381^1 water 

Reaction conditions 94**C, 1'; 58*C, 2'; 72**C, 2'; 30 cycles. 
40 The PCR product was pwified on a 1 % agarose gel and digested with restriction endonuclease Sa1 1. The 

Sa1 1 fragment was then ligated into Sa1 1 cut pGEM3Zf(+). The ligation mbc was used to transform E. rail XL-1 

blue. Plasmid DNA was isolated from white transfomiants and sequenced by the dideoxy chain termination 

method. 



45 PCR Amplrfication. Cloning and Sequencing of p5>15 

Based on the sequence of p4238 clones, two specific PCR primers were synthesized; oligo 113 
5'TTTGTCGACAACACAGGACGGCTTGAAG 3' SEQ ID NO:9 and oligo 74 5' 
TTTGTCGACATACTCCTGGAAGATGTCC 3' SEQ ID NO"10. 
SO These primers were used in combination with oligo A1 7 5' GACTCGAGTCGACATCG 3' SEQ ID NO:8 to amplify 
the cDNA encoding the amino tenmlnus of VEGF A subunit using the 5' RACE technique described by Frohman 
_t a[., supra. Oligo 1 51 was synthesized in order to specifically prime VEGF A subunit cDNA from GS-9L RNA 
Oligo 151 is 5' 

CTTCATCATTGCAGCAGC 3' SEQ ID NO:11. 
55 RNA was isolat d from GS-9L cells using the Fast Track RNA isolation kit from Invitrogen using the protocol 

provided. First strand cDNA synthesis was perfonn d as follows; 

On ^g of GS9L RNA was ann ailed to 1 ^g of oligo 151 by incubating In a volume of 6 ixl at 70''C for 
5' follow d by cooling to room temperature. To this reaction was added: 
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1.5 nl 1 0X buff r (500mM Tris-Ha, pH 8.3, 750 mM KQ, 100 mM MgCIa, 5 mM spermidine) 
2.5 nl lOmMDTT 

2.5 Ml 10 mM ach dATP, dGTP, dCTP. dTTP 

0.6 |xJ 25 units RNasIn 

2.5 ^1 40 mM Na pyrophosphate 

9.5 ^1 20 units diluted reverse transcriptase 

The reaction was incubated at 42*C for 1 hour. Excess oligo151 was removed by Centricon 100 spin col- 
umns and the 5' end of the cDNA was tailed by the addition of dATP and terminal transferase. The tailed cDNA 
was dOuted to a final volume of 150 |il in 10 mM Tris-HCI, 1 mM EDTA, pH 7.5. 

PCR Reactions: 



Primary reaction (50 jil) 
5 nl 10X buffer from Perkin Elmer Cetus GeneAmp Kit 

t5 8 nl 1 .25 mM each stock of dATP.dCTP.dGTP, and dTTP 

5 ^1 first strand GS-9L cDNA prime with oligo 151 and tailed 
1 \i\ 25 pMoles oligo 113 

1 |il 25 pMoles oligo A17 

1 ^1 10 pMoles oligo TA17 

20 0.25 |il 1.25 units Amplitq DNA polymersase 
28.75 Ml water 

Reaction conditions; 1 cyde 94*»C 1'; 50*»C 2'; 72**C 40' then 40 cycles of 94*C 1'; 50*C 1'30"; 72**C 2' 
Prep scale secondary reaction: 

-25 

60^1 1 0X buffer 

96 mI 1 .25 mM each stock of dATP, dCTP, dGTP, and dTTP 

6 \i\ primary PCR reaction 
12^1 300 pMoles oligo74 

30 12 Ml 300 pMoles oligo A17 

3 Ml 15 units Amplitaq DNA polymerase 
411 mI water 

Reaction conditions 94**C. 1'; 55°C. 2'; 72**C, 2' 30 cycles. 
The PCR product was concentrated by Centricon 100 spin columns, and digested with restriction endonuc- 
35 lease Sail. The Sail fragment was then ligated into Sail cut pGEM3Zf(+). The ligation mbc was used to transform 
E. coll XL-1 blue. Plasmid DNA was isolated from white transfomnants and sequenced by the dideoxy chain 
tenmination method. The base sequence is shown in Fig. 5. 



aoning and sequencing of alternative fonrns of VEGF A cDNA 

Based on the sequence obtained from the p5-15 and pW-3 clones, two specific PCR primers were syn- 
thesized; oligo 5' C 5' TTTGTCGACAACCATGAACTTTCTGC 3' SEQ ID NO:12 and oligo 181 5' 
TTTGTCGACGGTGAGAGGTCTAGTTC 3' SEQ ID NO:13. These primers were used together to amplify mul- 
tiple cDNAs encoding alternative fomns of the VEGF A subunit. 

Preparative PCR Reactk>n: 



50 Ml 1 0X buffer 

80 mJ 1 .25mM each stock of dATP, dCTP, dGTP, and dTTP 

so 1 0 Ml first strand GS-9L cDNA 

1 0 Ml 300pMoles oligo 5'C 

10 m1 300pMoles oligo 181 

2.5 mI 15 units Amplitaq DNA polymerase 

337.5 mI water 



55 R action conditions 94*C, 1'; 58«>C, 2'; 72<»C, 3'; 40 cycles. 

The PCR product was extracted with phenol/chlorofonm, concentrated by Centricon 30 spin columns, pre- 
cipitated by ethanol. and digest d with restriction endonudeas Sal I, and ligat d into Sail cut pGEM3Zf(+). 
The ligation mbc was used to transform E.goM XL-1 blue. Plasmid DNA was isolated from white transfonmants 
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10 



35 



and sequenced by the dideoxy chain termination method. Three s ts of clones were identified. Clone#12 en- 
cod d the 1 90 amino acid form of VEGF A subunit identical to that shown in Fig. 1 . The 1 64 amino acid secret d 
fomn of VEGF A subunit is that amino acid sequence running continuously from Ala^^ to Arg^^. Clone#14 has 
a 135 base pair d letion betwe n the second base of th Asn^^ codon and th third base of the Arg^^ codon. 
This done thus encodes a 1 46 aa fonm of the VEGF A su bunit with the conversion of Asn^^ to Lys^^. The 1 20 
amino acid secreted form of VEGF A subunit runs from Ala^^ to Asn^^, which becomes Lys^^ and does not 
begin until Cys^^, this form also finishes at Arg^^, Figure 4. Clone #16 has a 72 base pair insertion between 
the second and third base of the Asn^^ codon. This done thus encodes the 214 amino acid form of the VEGF 
A subunit with the conversion of Asn^^ to Lys^^, Figure 6. 

EXAMPLE 10 



Cloning and Sequencing of the VEGF B Subunit 

15 PCR Amplification, Cloning and Sequencing of pYG 

Two degenerate oligonudeotides were synthesized in order to amplify the cDNA encoding the peptide se- 
quences of VEGF B on Lys C firagment L50. These oligonudeotides were: 
Yl 5' TTTGTCGACATAITC]AT[TCA]GC[N]GA|TC]GAIAG]C 3' SEQ ID NO:14 
20 GC 5' TTTGTCGACTC[AG]TC[AG]TT[AG]CA[AG]CAIN]CC 3' SEQ ID NO:15 where N=ACGT 

RNA was isolated from GS-9L cells using the Fast Track RNA Isolation kit from Invitrogen and the protocol 
provided. First strand cDNA synthesis was performed as follows; 

1 ng of GS-9L poly A+RNA was annealled to 1 \ig of adapter primer TA17, 
5'GACTCGAGTCGACATCGA I I I I II I II II I I II TT 3' SEQ ID NO: 5, by incubating in a volume of 10 at 
25 70^0 for 5 min. followed by cooling to room temperature. To this reaction was added: 
3.0 [il water 

2.5 \l\ IOX buffer (500 mM Tris-Hd, pH 8.3, 750 mM KCI, 100 mM MgCIa, 5mM spenmidine) 
2.5^1 lOOmMDTT 

2.5 10 mM each dATP, dGTP, dCTP. dTTP 

30 0.6 |xl 15 units RNasIn 

2.5 ^1 40 mM Na pyrophosphate 
1 .5 ^i 1 5 units reverse transcriptase 

and the reaction was incubated at 42''C for 1 hour, then diluted to 1 ml in 10 mM Tris-HCI, I mM EDTA, 
pH 7.5. 



PCR Reactions: 



Primary reaction (50^1) 
5 ^1 IOX buffer from Perkin Elmer Cetus GeneAmp kit 

40 8 ^il 1.25 mM each stock of dATP, dCTP, dGTP, and dTTP 

1 ^l first strand GS-9L cDNA 

1 ^1 50 pMoles digo Yl 

1 111 50 pMoles oligo GC 

0.25 ^1 1 .25 units Amplitaq DNA polymerase 

45 33.75 fil water 

Reactbn conditions, 40 cydes of 94*»C, 1'; 50**C. 2'; 72*»C. 2', 

Prep scale reaction: 

50 60^1 1 0X buffer 

96 ^1 1 .25mM each stock of dATP, dCTP, dGTP, and dTTP 
12 ^l first strand 659L cDNA 
12 ^1 SOOpMoles digo Yl 
12 ^1 500pMoles oligo GC 
55 3 ^1 15 units Amplitaq DNA polymerase 
405 \i\ water 

Reaction conditions 94X. 1'; 50*»C, 2'; 72^C. 2' 40 cydes. 
The PCR product was concentrated by Centricon 30 spin columns and digested with restriction endonuc^ 
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lease Sail. The Sail fragment was then ligated into Sail cut pGEM3Zf(+). The ligation mix was used to transfomi 
E, coli XL-1 blue. Plasmid DNA was Isolated from white transfbmnants and sequenc d by the dideoxy chain 
tenmination method. 

PGR Amplification, Cloning and Sequencing of p3V2 

Based on the sequence obtained from the pYG clones, a specific PCR primer was synthesized; oligo HP 
5' TTTGTCGACACACCCTAATGAAGTGTC 3' SEQ ID NO:16. 
This primer was used in combination with oligo A17 5' 

GACTCGAGTCGACATCG 3' SEQ ID NO:8 to amplify the cDNA encoding the COOH terminus of the VEGF B 
subunit using the 3' RACE technique described by Frohman et al., PNAS 85: 8998-9002 (1988). 

Preparative PCR reaction: 

60 nl 10X buffer from Perkin Elmer Cetus Gene Amp Kit 

12 fil first strand 659L cDNA 

96 \il 1 .25 mM each of d ATP. dCTP, dGTP, dTTP 

1 2 |il 300 pMoles oligo Al 7 

12 fit 300 pMoles oligo HP 

3 ^1 15units Amplitaq DNA polymerase 

405 fil water 

Reaction conditions 1 cyde of 94**C. 1'; 58«C, 2'; 72*'C, 2'; followed by 40 cycles 94«»C, 1', 58°C, 2' and 
72*»C, 2'. 

The PCR product was concentrated by Centricon 30 spin columns, precipitated with ethanol and digested 
with restriction endonudease Sa1lrThe Sairfragment was then Ugated into Sa1|-cut pGEM3 
mix was used to transform E. ooH. XL-1 blue. Plasmid DNA was isolated from white transformants and sequ- 
enced by the dideoxy chain termination method. 

PCR Amplification, Cloning and Sequencing of p5V2 

Based on the sequence of pYG clones, two specific PcR primers were synthesized; oligoVL' 
5' TTTGTCGACAACAGCGACTCAGAAGG 3' SEQ ID NO: 17 and oligoVS' 5' TTTGTCGACACTGAATATAT- 
GAGACAC 3' SEQ ID NO: 18. These primers were used in combination with oligo A17 
5' GACTCGAGTCGACATCG 3' SEQ ID NO:8 to amplify the cDNA encoding the amino tenninus of the VEGF 
B subunit using the 5' RACE technique described by Frohman etal., supra. Oligo 151 was synthesized In order 
to prime cDNA from GS-9L RNA Oligo 151 is 5' CTTCATCATTGCAGCAGC 3' SEQ ID NO:11. 

Poly A+RNA was isolated from GS9L cells using the Fast Track RNA isolation kit from Invitrogen using the 
protocol provided. First strand cDNA synthesis was performed as follows: 

One ^g of GS9L RNA was annealled to 1 ^g of oligo 151 by 

incubating in a volume of 6 ^1 at 70°C for 5' followed by cooling 

to room temperature. To this reaction was added: 
1.5 ^l 10X buffer (500 mM Tris-HCI, pH 8.3. 750 mM KCI, 100 mM MgCb, 5mM spermidine) 
2.5 ^1 10 mM DTT 

2.5 fil 10 mM each dATP, dGTP, dCTP, dTTP 
0.6 ^l 25 units RNasin 
2.5 |xl 40 mM Na pyrophosphate 
9.5 ^l 20 units diluted reverse transcriptase 
The reaction was incubated at 42*^c for 1 hour. 

Excess oligo 151 was removed by Centricon 100 spin columns and the 5' end of the cDNA was tailed by 
the addition of dATP and temiinal transferase. The tailed cDfsIA was diluted to a final volume of 150 ^1 in 10 
mM Tris-Ha. 1 mM EDTA,pH 7.5 

PCRR actions: 

Primary reaction (50 jil) 
5 ^1 1 0X buffer from Perkin Elmer Cetus GeneAmp Kit 

8 ^il 1 .25 mM ach stock of dATP.dCTP.dGTP. and dTTP 

5 ^l first strand GS9L cDNA primed with oligolSI and tail d 
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1 ^1 25 pMoles oligo VL' 

1 ^1 25pMol soligoA17 

1 ixl lOpMol soligoTAIZ 

0.25 ^l 1 .25 units Amplitq DNA polymersase 

5 28.75 ^1 water 

Reaction conditions; 1 cyde 94*C,1'; 58**C, 2'; 72*»C, 40' then 40 cycles of 94»C, 1'; 58'»C, 2'; 72*»C. 2'. 



Prep scale secondary reaction 

10 ^00\l^ 10X buffer 

160 nl 1 .25 mM each stock of dATP, dCTP, dGTP, and dTTP 

1 0 ^1 primary PGR reactbn 

20 ^1 500 pMoles oligo VS' 

20 ^1 300 pMoles oligo A1 7 
15 5 \i\ 25 units Amplitaq DNA polymerase 

685 ^1 water 

Reaction conditions 94**C, 1'; 58*»C, 2'; 72*»C, 2' 30 cycles. 
The PGR product was extracted with phenol/chlorofonm, concentrated by Gentricon 30 spin columns, pre- 
cipitated by ethanol, and digested with restriction endonuclease Sail. The Sail fragment was purified on 4% 
20 Nu-Sieve Agarose gel then ligated into Sail cut pGEM3Zf('i'). The ligation mix was used to transform E. coH 

XL-1 blue. Plasmid DNA was isolated firom white transformants and sequenced by the dideoxy chain termination 

method. 



PGR Amplification, Gloning and Sequendng of pCV2 and pGV2.1 

25 

Based on the sequences of the p3V2 and p5GV2 dones, two specific PGR primers were synthesized; oligo 
5'GV2.1 

5' TTTGTGGAG[N][N]GGAGGTGGTAGGTG 3' SEQ ID NO;19 and oligo 3'GV2 5' 
TTTGTGGAGIN]tN]GTAATAAATAGAGGG 3' SEQ ID NO:20. 
30 These primers were used together to amplify the cDNA encoding the VEGF B subunit 



Preparative PGR Reaction: 



40^1 1 0X buffer 
35 64 ^1 1 .25 mM each dATP, dTTP, dGTP. dGTP 

8 \}Ji first strand GS-9L cDNA 

8 ^1 200 pMoles 5'GV2.1 

8 ^1 200 pMoIes 3'CV2 

2 111 lOunits Amplitaq DNA polymerase 
40 270 ^1 water 

Reaction conditions: 94*»G. 1'. 58*»G, 2', 72*'G, 2'; 40 cydes. 
The PGR product was extracted with phenol/chloroform, concentrated by Gentricon 30 spin columns, pre- 
cipitated by ethanol, and digested with restriction endonudease Sal 1, and ligated into Sal 1 cut pGEM3Zf(-i-). 

The ligation mbc was used to transfomn E. coli XL-1 blue. Plasmid DNA was isolated form white transfiromants 
45 and sequenced by the dideoxy chain termination method. Two sets of dones were Identified, one encoded a 

158 anriino acid sequence and the other encoded a 138 amino acid sequence, see Figures 7 and 8. 

cDNA Gloning of VEGF B Subunit 



50 The DNA and protein sequences for the amino terminus of the signal peptide of VEGF B was determined 

from a cDNA done isolated from a cDNA library constructed from GS-9L polyA+ RNA. 
First Strand Synthesis 

Ann al 15.6^1 (5ug) GS-9L polyA+ RNA and 2.5\x\ (2.5ug) oligo dT-Xbal prim r by heating to 70** G 5' 
slow cool to room temperature. Add th following: 
55 5.5^1 10X buffer (500 mM Tris-Ha. pH 8.3 {42^ G), 750 mM KGl. 100 mM MgGlz. 5mM spenmidine 
5.5nl lOOmM DTT 

5.5^l 1 0 mM each dATP. dTTP, dGTP. dGTP 
1.4(jU (55units) RNasin 
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5.5^1 


40mM NaPPI 


13.5^1 


(55units) reverse transcriptase 




Incubate at 42<'C 60'. 


5 Second Strand Synthesis: 


Assennble reaction mix 


50 (il 


first strand reaction 


25 ^l 


10X buffer (500 mM Tris-HCI, pH7,2, 850 mM KCL, 30 mM MgOz 1mg/nnl BSA, 100 mM (NH4)2S04 


10 7.5 \jd 


100 mM DTT 


25^1 


1mM ^4AD 


6.5^1 


(65units) E. coH DNA Polymerasel 


2.5 |il 


(2.5units) E. coli DNA Ligase 


2.5 


(2 units) E. coll RNase H 


15 135 ^1 


water 



incubate at 14*» C for 2h and ttien incubate 70'' C for 1 0'. Add 1 ul (10 units) T4 DNA Polymerase, incubate 
at 37* C for 1 0'. add 25 jJ 0.2M EDTA an extract with phenol/chlorofonm. then precipitate by the addition of 0.5 
volume of 7.5 M ammonium acetate and 3 volumes of ethanol, collect precipitate and resuspend in 20 ^1 of 10 
mM Tris-HQ, pH 7.5. ImM EDTA. 
20 cDNA Library Construction 

The above cDNA was ligated into EcoRI/ Xbal digested LambdaGEM-4 (Promega Biochemicals) after the 
addition of EcoR1 linkers and digestion with EcoRI and Xbal. A cDNA library was amplified from -50, 000 
Independent clones. 

Isolation of Rat VEGF B cDNA Qone 

-25 The above cDNA Iibrary was screened by placque~hybridizatlon~us1ngi>CV2Wl^^ 

ditions were as follows: 

5XSSC (1XSSC is 0.15M sodium chloride, 0.015M sodium citrate, 
50% Formamide 

5X Denhardfs Solution (1% Ficoll. 1% polyvinylpyrrolidone. 1% bovine serum albumin) 
30 0.15 nngAnI salmon sperm DNA hybridize overnight at 42*" C. 

Filters were washed 3 times in 2XSSC, 0.1% SDS at room temerature for 5', then 1 time in 1XSSC, 0.1% 
SDS at 50C for 30'. Positive clones were identified by autoradiography. 

The DNA from phage #202 was digested with restriction endonuclease Spel and the 1.1 kb band ligated 
into Xbal digested pGEM3Zf(-i-). The ligation mix was used to transform E.coii XL-1 blue. Plasmid DNA was 
35 isolated from white transformants and sequenced by the dideoxy chain terminatk>n method. The cDNA sequ- 
ence and predicted amino acid sequence of the signal peptide are shown in Figures 7 and 8. 

The entire nucleotide and amino acid sequence of the 1 38 amino acid fonfn is shown in Fig. 7. The secreted 
protein starts at Ala24 and continues to Arg^^e. jhe entire nucleotide and amino acid sequence of the 158 amino 
acid form is shown in Figures 8. The secreted protein starts at Ala^ and continues to Leu^^B. 

40 

EXAMPLE 11 

Cloning and sequencing VEGF C Subunit 

45 PGR Amplification, Cloning and Sequencing of pFSEM' 

Two degenerate oligonucleotides were synthesized based on the sequence of rat VEGF B monomer in or- 
der to amplify VEGF cDNAs from the human medulloblastoma line TE-671. ATCC HTB (McAllister etal., Int 
J. Cancer 20:208-21 2 [1 977]). These oligonucleotides were: 
50 FS 5'TTTGTCGACA TTC AGT CC(N) TC(N) TG(TC) GT 3' SEQ ID NO:21 

EM' 5' TTTGTCGACA CTG AGA GAA (N)GT CAT (CT)TC 3' 
SEQ ID NO:22 

where N= AGCT 

Poly A+ RNA was isolat d from TE-671 cells using th Fast Track RNA isolation kit from Invitrogen and 
55 th protocol provided. First strand cDNA synthesis was performed as follows using ttie cDNA Cyd kit from 
Invitrogen; 

1 ^1 1 ng of TE-671 polyA+ RNA 

19^1 water 
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5»il lOOmM MeMgOH 

6.25^1 0.7M B-mercaptoethanol 

2.5^1 random primer 

2.5^1 RNase Inhibitor 
5 10^1 5XRT buffer 

2.5^1 25mM dNTPs 

1.25^1 reverse transcriptase 12.5units 

The reaction was incubated for 60' at 42*'C, then 3' at 95 "C, placed on ice, then an additional 1.25ul revers 

transcriptase was added and the reaction incubated an additional 60' at 42''C. 
10 The above procedure was perfonmed in duplicate and the cDNAs pooled to a final volume of 100ul. 



PGR Reactions: 



Primary reaction (100|il) 
15 10 |J 10X buffer from Perkin Elmer Cetus GeneAmp kit 

16 1 .25mM each of dATP, dCTP, dGTP, TTP 

10 nl first strand TE-671 cDNA 

2 fil 50 pmoles FS primer 

2 ^il 50 pmoles EM' primer 
20 0.5 ^1 2.5 units Amplitaq DNA polymerase 

59.5 ^1 water 

Reaction conditions: 40 cycles of 90*'C, 1'; 2' ramp to 45'»C; 2' at 45''C; 2' at 72<*C. 
Gel Purification 

20 »il of the primary PGR reaction was purified on a 4% NuSieve agarose gel. The 1 80 base pair band was 
25 excised from the gel. heated to SS^'C for 5' and used directly as template for the secondary PGR reaction. 
Secondary PGR reactton 200jil 

20 ^il 1 0X buffer from Pertcin Elmer Getus GeneAmp kit 

32 ^1 1 .25mM each of dATP. dGTP, dGTP, TTP 

5 ^l melted gel slice 
30 4 111 100 pmoles FS primer 

4 ^1 100 pmoles EM' primer 

1 ^1 5 units Amplrtaq DNA polymerase 

134^1 water 

Reaction conditions: 35 cycles of 94^G, V; 50^*0, 2'; 72°G. 2' 
35 The PGR product was purified on a Qiagen tip 5 column, then digested with restriction endonuclease Sail. 

The Sail fragment was then ligated into Sail cut pGEM3Zf(+), and the ligation mix used to transfomri E. coli 

XL-1 blue. Plasmid DNA was isolated from white transformants and sequenced by the dkleoxy chain termination 

method. 



40 PGR Amplification, Gloning and Sequencing of p3'.19 

Based on the sequence obtained from the pFSEM' clone, a specific PGR primer was synthesized; 
oligo LH 5' TTTGTGGAGA GTG GAG TGT GTG GGG GTG 3' SEQ ID NO:23. This primer was used in combi- 
nation with oligo A17,5' GAGTGGAGTGGAGATGG 3' SEQ ID NO:24, to amplify the cDNA encoding the GOOH 
45 terminus of the VEGF G subunit using the 3' RAGE technique described by Frohman et al., PNAS 85:8998-9002 
(1988). 

Poly A+ RNA was isolated from TE-671 cells using the Fast Track RNA isolation kit firom Invitrogen and 
the protocol provided. First strand cDNA synthesis was performed as follows using the cDNA Gyde kit from 
Invitrogen and the TA17 adapter primer. 
50 TA1 7 5' GAGTGGAGTGGAGATGGAI i II I I I I I I I I I I I I 1 3' SEQ ID NO:5 



0.8^1 


1^g of TE-671 polyA+ RNA 


20.7^1 


water 


5^1 


100 mM MeMgOH 


6.25^1 


0.7 M B-mercaptoethanol 


1.0^1 


0.88 ^g primer TA1 7 


2.5^1 


RNase Inhibitor 


IOmI 


5X RT buff r 


2.5^1 


25mM dNTPs 
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1.25 |il reverse transcriptase 12.5 units 

The reaction was incubated for 60' at 42X. then 3' at 95^C. placed on ice, then an additional 1 .25ul reverse 
transcriptase was added and the reaction incubated an additional 60' at 42**C. 

3' RACE PGR 



20 ^ 1 0 X buffer from Perkin Elmer Cetus GeneAmp kit 

32 ^1 1 .25mM each of dATP. dCTP, dGTP, TTP 

20 ^1 first strand TE-671 cDNA primed with TA17 

2 jil 50 pmoles LH primer 

2^1 50 pmoles A1 7 primer 

1 .0 ^1 5 units Amplitaq DNA polymerase 

123^1 water 



Reaction conditions: 40 cydes of 94C, 1'; 2' at 58**C; 3' at 72*C. 

The PGR product was purified on a Qiagen tip 5 column, then digested with restriction endonudease Sail. 
The Sail fragment was then ligated into Sail cut pGEM3Zf(+). and the ligation mix used to transfonn E, coli 
XL-1 blue. Piasmid DNA was Isolated from white transfonmants and sequenced by the dideoxy chain termination 
method. 

PGR Amplification, Cloning and Sequencing of p5M6 

Based on th e sequence obtained from the pFSEM' done, two specific PGR primers were synthesized; 
oligo VE' 5' TTTGTGGAGA AG ATT GGG CGT CTC GAG C 3' SEQ ID NO:24. and oligo TG' 5' TTTGTCGACA 
ATC GGG GCA GCA GGG GGT 3' SEQ ID NO:25. These primers were used in combination with 

-oligo-A1775'-GAeTCGAGTCGACATCG-3' SEQ IDl^O:8randnoligoTA17~ '~ ~ ~ 

5'GACTGGAGTCGACATCGAI 1 I I I II M 1 I I I I I TT 3' SEQ ID NO:5 to amplify the cDNA encoding tiie amino 
tenfninus of the VEGF G subunit using the 5' RAGE technique described by Frohman et al.. PNAS 85: 8998-9002 
(1988). 

Poly A+ RNA was isolated from TE-671 celts using the Fast Track RNA isolation kit from Invitrogen and 
tiie protocol provided. First strand cDNA synthesis was performed as follows using the cDNA Cyde kit from 
Invitrogen and the VE' primer 



1 .0 Ml 1 ng of TE.671 polyA+ RNA 

20.25 \i\ water 

5 Ml lOOmMMeMgOH 

6.25 Ml 0.7 M B-fnercaptoethanol 

1.0 Ml 1.0 Mg primer VE' 

2.5 Ml RNase Inhibitor 

10 Ml 5XRT buffer 

2.5 Ml 25 mM dNTPs 

0.5 Ml AMV reverse transcriptase (Promega) 1 0units 



The reaction was incubated for 60'at 42^G, then 3' at 95°C. placed on ice, then an additional 1.25ul reverse 
transcriptase was added and the reaction incubated an additional 60' at 42''G. Excess oligo VE' was removed 
by a Gentricon 100 spin column and the 5' end of the cDNA was tailed by the addition of dATP and terminal 
transferase. The tailed cDNA was dOuted to a final volume of 200 ul In lOmM Tris-HGI, 1mM EDTA, pH 7.5. 

5' RAGE PGR5X100ul 



1 0 mJ 10X buffer from Perkin Elmer Cetus GeneAmp kit 

1 6 Ml 1 .25mM each of dATP. dCTP, dGTP, TTP 

10 Ml first strand TE-671 cDNA primed with VE' 

2 Ml 50 pmoles TG' primer 

2 mI 50 pmoles A17 primer 

2 Ml 20 pmol s TA17 prim r 

0.5 mI 2.5 units AMplltaq DNA poiymeras 



57.5 Ml water 

Reaction conditions: 40 cydes of 94'*G, 1'; 2' ramp to 58'*G; 2' at 58*'G; 2' at 72*»G. 
The PGR product was purified on a Qiagen tip 5 column, then digested with restriction endonudease Sail. 
The Sail fragment was then ligated into Sail cut pGEM3Zf(+), and the ligation mix used to transfomn E. coli 
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XL-1 blue. Plasmid DNA was isolated from white transformants and sequenced by the dideoxy chain termination 
method. The combined sequences form plasmids pFSEM'. p3'19 and p5'16 are shown in Figure 9. 

PGR Ampllficpotion, Cloning and Sequ ncing of phVC16 and phVC2 

5 

Based on the sequences of the p5'. 16 and p3'. 19 clones, two specific PCR primers were synthesized; 
oligo 5' GCVB 5' TTTGTCGAC TGG CTC TOG ACG TCT GAG 3' SEQ ID NO;26 and oligo 3'VC 5'TTTGTCGAC 
ACT GAA GAG TGT GAC GG 3' SEQ ID NO:27. These primers were used together to amplify the cDNA en- 
coding the complete VEGF C subunit 
10 Poly A+ RNA was isolated from TE-671 cells using the Fast Track RNA isolation kit from Invltrogen and 

the protocol provided. First strand cDNA synthesis was performed as follows using the cDNA Cyde kit from 
invitrogen; 

0.8^1 1 ^g of TE-671 polyA+ RNA 

19.2^1 water 
IS 5\j\ lOOmMMeMgOH 

6.25^] 0.7 M B-mercaptoethanol 

2.5^1 oligo dT primer 

2.5|il RNase Inhibitor 

lOnl 5XRT buffer 
20 2.5^1 25 mM dNTPs 

1 .25^1 reverse transcriptase 1 2.5u nits 

The reaction was incubated for 60' at 42C, then 3' at 95C, placed on ice, then an additional 1.25ul reverse 

transcriptase was added and the reaction incubated an additional 60' at 42C. 
PCR Reaction 200 ul 
25 20^1 1 0X buffer from Perkin Elmer Cetus GeneAmp kit 

32jil 1 .25mM each of dATP. dCTP. dGTP. TTP 

20|xl first strand TE-671 cDNA primed with oligo dT 

4^l 50 pmoles 5' GCVB primer 

4\i\ 50 pmoles 3'VC primer 

30 1 ^l 5 units Amplitaq DNA polymerase 

119^l water 

Reaction conditions: 40 cycles of 94°C. 1';; 2' at 50°C; 2'at 72^C. 

The PCR product was purified on a Qiagen tip 5 column, then digested with restriction endonudease Sail. 
The Salt fragment was then iigated into Sail cut pGEM3Zf(-i-), and the ligation mix used to transfomi E. coll 
35 XL-1 blue. Plasmid DNA was isolated from white transformants and sequenced by the dideoxy chain temriination 
method. In tiie sequences of clones phVC16 and phVC2 base 463 (Fig. 9) was changed from a T to a C elimi- 
nating the translational stop codon following amino acid 154; this results in the addition of 16 amino acids fol- 
lowing amino acid Lys 154. The nucleotide sequence and the deduced amino acid sequence of this addition 
is: 



40 



45 



50 



CAG AGA OOC ACA GAC TGC CAC CTG TGC GGC GAT GCT GTT 
Gin Arg Pro Thr Asp Cys His Leu Cys Gly Asp Ala Val 
155 160 165 

COC OGG AQG TAA 
Pro Arg Arg 

170 SEQIDNa29 

In addition clone phVC16 contains a 3 base pair deletion (Figure 9, nucleotide residues 73-75) resulting in the 
deletion of Gin 25. 
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5 

SEQUENCE LISTING 

<2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Not Applicable 

(D) TOPOLOGY: linear 

IS 
20 

-25 : 

30 
35 
40 
45 
SO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
Ala Pro Thr Thr Glu Gly Glu Gin Lys Ala His Glu Val 

5 10 

Val 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Not Applicable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Ala Leu Ser Ala Gly Asn Xaa Ser Thr Glu Met Glu Val 

5 10 
Val Pro Phe Asn Glu Val 
15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTTGTCGACT TYATGGAYGT NTAYCA 26 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CAGAGAATTC GTCGACARTC NGTRTTYTTR CA 32 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GACTCGAGTC GACATCGATT tTTTTTTTTT tTTTT 35 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNE SS : s ing le 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TTTGTCGACA ACACAGGACG GCTTGAAG 28 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TTTGTCGACG AT^TCACTG TGAGC 25 

^5 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GACTCGAGTC GACATCG 17 

(2) INFORMATION FOR SEQ ID NO: 9: 
-25 (i-) SEQUENCE-CHARACTERTSTICSl 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTTGTCGACA ACACAGGACG GCTTGAAG 28 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
35 <A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
^ TTTGTCGACA TACTCCTGGA AGATGTCC 28 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
CTTCATCATT GCAGCAGC 18 

50 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TTTGTCGACA ACCATGAACT TTCTGC 26 

10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTTGTCGACG GTGAGAGGTC TAGTTC 26 

20 (2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TTTGTCGACA TAYATHGCNG AYGARC 26 

(2) INFORMATION FOR SEQ ID NO: 15: 
3^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TTTGTCGACT CRTCRTTRCA RCANCC 26 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
TTTGTCGACA CACCCTAATG AAGTGTC 27 

45 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TTTGTCGACA ACAGCGACTC AGAAGG 2 6 

^ (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TTTGTCGACA CTGAATATAT GAGACAC 27 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

TTTGTCGACN NGCAGGTCCT AGCTG 25 

(2) INFORMATION FOR SEQ ID NO: 20: 
-25 (-i-) SEQUENeE-GHARACTERrSTrCS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

TTTGTCGACN NCTAATAAAT AGAGGG 2 6 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
^ TTTGTCGACA TTCAGTCCNT CNTGYGT 27 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
TTTGTCGACA CTGAGAGAAN GTCATYTC 2 8 

50 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 
s (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TTTGTCGACA CTGCACTGTG TGCCGGTG 2 8 

10 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
TTTGTCGACA ACATTGGCCG TCTCCACC 28 

20 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTTGTCGACA ATCGCCGCAG CAGCCGGT 28 

(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
35 TTTGTCGACT GGCTCTGGAC GTCTGAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TTTGTCGACA CTGAAGAGTG TGACGG 2 6 

45 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

so (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

^ CAG AGA. CCC ACA GAC TGC CAC CTG TGC GGC GAT 

GCT GTT 39 

CCC CGG AGG TAA 

51 

10 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acids 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Gin Arg Pro Thr Asp Cys His Leu Cys Gly Asp 
20 Ala Val 

5 10 

Pro Arg Arg 
15 

-25 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AACC 



35 



ATG AAC TTT CTG CTC TCT TGG GTG CAC TGG ACC CTG GCT TTA CTG 4 9 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 



CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
40 20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 40 

^ TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 

Tyr Gin Arg Ser Tyr Cys Arg Pro He Glu Thr Leu Val Asp He 
50 55 60 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
gQ Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 
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TGT GTG CCC CTA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 27 4 
Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
g 80 85 90 

CTG GAG TGC GTG CCC ACG TOG GAG AGC AAC GTC ACT ATG CAG ATC 319 
Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

^0 ATG CGG ATC AAA OCT CAC CAA AGC CAG CAC ATA GGA GAG ATG AGC 364 

Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 4 09 
Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 

ACA AAG CCA GAA AAT CAC TGT GAG CCT TGT TCA GAG CGG AGA AAG 4 54 
Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 
140 145 150 

20 

CAT TTG TTT GTC CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA 4 99 
His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys 
155 160 165 

AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA 54 4 
25 Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu 

170 175 180 

CGT ACT TGC AGA TGT GAC AAG CCA AGG CGG TGA 57 7 

Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 

30 "0 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acids 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 
^ 5 10 15 

Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

^ Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 

35 40 45 

Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp He 
50 55 60 

50 Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 



35 



55 



29 



( 



( 



EP0 506 477 A1 



10 



65 70 75 

Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin lie 
95 100 105 

Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 

15 Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 

140 145 150 

His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys 
155 160 165 

Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu 
170 175 180 

Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
185 190 

-25 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ACCA 4 

ATG AAC TTT CTG CTC TCT TGG GTG CAC TGG ACC CTG GCT TTA CTG 4 9 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 
40 Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 

20 25 30 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 



35 
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TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp lie 
50 55 60 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 
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TGT GTG CCC 
Cys Val Pro 



CTG GAG TGC 
Leu Glu Cys 



ATG CGG ATC 
Met Arg lie 



TTC CTG CAG 
Phe Leu Gin 



ACA AAG CCA 
Thr Lys Pro 



65 

CTA ATG CGG 
Leu Met Arg 
80 

GTG CCC ACG 
Val Pro Thr 
95 

AAA CCT CAC 
Lys Pro His 
110 

CAT AGC AGA 
His Ser Arg 
125 

GAA AAA TGT 
Glu Lys Cys 
140 



EP 0 506 477 A1 



TGC AAT GAT 
Cys Asn Asp 



GTC ACT ATG 
Val Thr Met 



CCA AAG AAA 
Pro Lys Lys 



CGG TGA 
Arg 



75 

GAA GCC 274 
Glu Ala 
90 

CAG ATC 319 
Gin He 
105. 

ATG AGC 364 
Met Ser 
120 

GAT AGA 409 
Asp Arg 
135 

445 



70 

TGT GCG GGC TGC 
Cys Ala Gly Cys 
85 

TCG GAG AGC AAC 
Ser Glu Ser Asn 
100 

CAA AGC CAG CAC ATA GGA GAG 
Gin Ser Gin His He Gly Glu 
115 

TGT GAA TGC AGA 
Cys Glu Cys Arg 
130 

GAC AAG CCA AGG 
Asp Lys Pro Arg 
145 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp lie 
50 55 €0 

Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 
65 70 75 

Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

Met Arg He Lys Pro His Gin Ser Gin His lie Gly Glu Met Ser 
110 115 120 
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Phe I*eu Gin His Ser Arg Cys Glu Cys Arg Pro I#ys Lys Asp Arg 
125 130 135 

5 Thr Lys Pro Glu Lys Cys Asp Lys Pro Arg Arg 

140 145 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 
IS ID NO:34 : 

AACC 



20 



ATG AAC TTT CTG CTC TCT TGG GTG GAG TGG ACC CTG GCT TTA CTG 4 9 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

CTG TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ACG ACA 94 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

"25 

GAA GGG GAG CAG AAA GCC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

TAC CAG CGC AGC TAT TGC CGT CCG ATT GAG ACC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp lie 
50 55 60 

TTC CAG GAG TAC CCC GAT GAG ATA GAG TAT ATC TTC AAG CCG TCC 229 
Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 
65 70 75 

TGT GTG CCC CTA ATG CGG TGT GCG GGC TGC TGC AAT GAT GAA GCC 274 
Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
60 85 90 

40 CTG GAG TGC GTG CCC ACG TCG GAG AGC AAC GTC ACT ATG CAG ATC 319 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin lie 
95 100 105 



30 
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50 



ATG CGG ATC AAA CCT CAC CAA AGC CAG CAC ATA GGA GAG ATG AGC 3 64 

Met Arg lie Lys Pro His Gin Ser Gin His lie Gly Glu Met Ser 

110 115 120 

TTC CTG CAG CAT AGC AGA TGT GAA TGC AGA CCA AAG AAA GAT AGA 4 09 

Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 

125 130 135 

ACA AAG CCA GAA AAA AAA TCA GTT CGA GGA AAG GGA AAG GGT CAA 4 54 
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Thr Lys Pro Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin 
140 145 150 

AAA CGA AAG CGC AAG AAA TCC CGG TTT AAA TCC TGG AGC GTT CAC 4 99 
Lys Arg Lys Arg Lys Lys Ser Arg Phe Lys Ser Trp Ser Val His 
155 160 165 

TGT GAG CCT TGT TCA GAG CGG AGA AAG CAT TTG TTT GTC CAA GAT 54 4 
Cys Glu Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 
170 175 180 

COG GAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC 58 9 
Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
185 190 195 

AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 634 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
200 205 210 

AAG CCA AGG CGG TGA 64 9 

Lys Pro Arg Arg 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

Tyr Gin Arg Ser Tyr Cys Arg Pro lie Glu Thr Leu Val Asp lie 
50 55 60 

Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
65 70 75 

Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

Leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin He 
95 100 105 

Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 
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Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys I*ys Asp Arg 
125 130 135 

Thr Lys Pro Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin 
140 145 150 

Lys Arg Lys Arg Lys Lys Ser Arg Phe Lys Ser Trp Ser Val His 
155 160 165 

10 Cys Glu Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 

170 175 180 

Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
185 190 195 

l-ys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
200 205 210 

Lys Pro Arg Arg 

20 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
_25 CDJ T~0^1.QGY-: liiiea^ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATG CTG GCC ATG AAG CTG TTC ACT TGC TTC TTG CAG GTC CTA GCT 4 5 
Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 

GGG TTG GCT GTG CAC TCC CAG GGG GCC CTG TCT GCT GGG AAC AAC 90 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 25 30 

35 TCA ACA GAA ATG GAA GTG GTG CCT TTC AAT GAA GTG TGG GGC CGC 135 

Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 



AGC TAC TGC CGG CCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 180 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr lie Ala Asp Glu 
50 55 60 

CAC CCT AAT GAA GTG TCT CAT ATA TTC AGT CCG TCA TGT GTC CTT 225 
His Pro Asn Glu Val Ser His lie Phe Ser Pro Ser Cys Val Leu 
65 70 75 

CTG AGT CGC TGT AGT GGC TGC TGT GGT GAC GAG GGT CTG CAC TGT 27 0 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 
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GTG GCG CTA AAG ACA GCC AAC ATC ACT ATG CAG ATC TTA AAG ATT 315 
Val Ala Leu Lys Thr Ala Asn lie Thr Met Gin lie L u Lys II 
95 100 105 

CCC CCC AAT CGG GAT CCA CAT TCC TAC GTG GAG ATG ACA TTC TCT 360 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 115 120 

CAG GAT GTA CTC TGC GAA TGC AGG CCT ATT CTG GAG ACG ACA AAG 4 05 
Gin Asp Val I-eu Cys Glu Cys Arg Pro lie Leu Glu Thr Thr Lys 
125 130 135 

GCA GAA AGG TAA 417 
Ala Glu Arg 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 

Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 25 30 

Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 

Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr lie Ala Asp Glu 
50 55 60 

His Pro Asn Glu Val Ser His lie Phe Ser Pro Ser Cys Val Leu 
35 65 70 75 

Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 



20 



25 



30 



40 



Val Ala Leu Lys Thr Ala Asn lie Thr Met Gin lie Leu Lys lie 

95 100 105 

Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 

110 115 120 

45 Gin Asp Val Leu Cys Glu Cys Arg Pro lie Leu Glu Thr Thr Lys 

125 130 135 

Ala Glu Arg 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATG CTG GCC ATG AAG CTG TTC ACT TGC TTC TTG CAG GTC CTA GCT 4 5 
Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 

GGG TTG GCT GTG CAC TGC CAG GGG GCC CTG TCT GCT GGG AAC AAC 90 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 
20 25 30 

TCA ACA GAA ATG GAA GTG GTG CCT TTC AAT GAA GTG TGG GGC CGC 135 
Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 

AGC TAC TGC CGG CCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 180 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr He Ala Asp Glu 
50 55 60 

CAC CCT AAT GAA GTG TCT CAT ATA TTC AGT CCG TCA TGT GTC CTT 225 
His Pro Asn Glu Val Ser His He Phe Ser Pro Ser Cys Val Leu 
65 70 75 

CTG-AGT-CGC~TGT~AGT"GGC~TGC~TGT~GGT"GAC GAG GGT CTG CAC TGT 27 0 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 



IS 



20 



30 



35 



40 



GTG GCG CTA AAG ACA GCC AAC ATC ACT ATG CAG ATC TTA AAG ATT 315 
Val Ala Leu Lys Thr Ala Asn He Thr Met Gin He Leu Lys He 
95 100 105 

CCC CCC AAT CGG GAT CCA CAT TCC TAC GTG GAG ATG ACA TTC TCT 3 60 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 115 120 

CAG GAT GTA CTC TGC GAA TGC AGG CCT ATT CTG GAG ACG ACA AAG 4 05 
Gin Asp Val Leu Cys Glu Cys Arg Pro He Leu Glu Thr Thr Lys 
125 130 135 

GCA GAA AGG AGG AAA ACC AAG GGG AAG AGG AAG CAA AGC AAA ACC 4 50 
Ala Glu Arg Arg Lys Thr Lys Gly Lys Arg Lys Gin Ser Lys Thr 
140 145 150 



CCA CAG ACT GAG GAA CCC CAC CTG TGA 477 
45 Pro Gin Thr Glu Glu Pro His Leu 

155 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 158 amino acids 



55 



36 
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20 



<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

s 

Met Ijeu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 

10 Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 

20 25 30 

Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 
35 40 45 

Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr He Ala Asp Glu 
50 55 60 

His Pro Asn Glu Val Ser His He Phe Ser Pro Ser Cys Val Leu 
65 70 75 

Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 
80 85 90 

Val Ala Leu Lys Thr Ala Asn He Thr Met Gin He Leu Lys He 
95 100 105 

Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 115 120 

Gin Asp Val Leu Cys Glu Cys Arg Pro He Leu Glu Thr Thr Lys 
125 130 135 

Ala Glu Arg Arg Lys Thr Lys Gly Lys Arg Lys Gin Ser Lys Thr 
140 145 150 

Pro Gin Thr Glu Glu Pro His Leu 
35 155 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 65 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

45 ATG CCG GTC ATG AGG CTG TTC CCT TGC TTC CTG CAG CTC CTG GCC 4 5 

Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala 

5 10 15 
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GGG CTG GCG CTG CCT GOT GTG CCC CCC CAG CAG TGG GCC TTG TCT 90 
Gly Leu Ala Leu Pro Ala Val Pro Pro Gin Gin Trp Ala Leu Ser 
20 25 30 

GCT GGG AAC GGC TCG TCA GAG GTG GAA GTG GTA CCC TTC CAG GAA 135 
Ala Gly Asn Gly Ser Ser Glu Val Glu Val Val Pro Phe Gin Glu 
35 40 45 

GTG TGG GGC CGC AGO TAG TGC GGG GGG CTG GAG AGG GTG GTG GAG 180 
Val Trp Gly Arg Ser Tyr Cys Arg Ala Leu Glu Arg Leu Val Asp 
50 55 60 

GTC GTG TCC GAG TAC CCC AGC GAG GTG GAG CAC ATG TTC AGC CCA 225 
Val Val Ser Glu Tyr Pro Ser Glu Val Glu His Met Phe Ser Pro 
65 70 75 

TCC TGT GTC TCC CTG CTG CGC TGC AGC GGC TGC TGC GGC GAT GAG 270 
Ser Cys Val Ser Leu Leu Arg Cys Thr Gly Cys Cys Gly Asp Glu 
80 85 90 

AAT CTG CAC TGT GTG CCG GTG GAG ACG GCC AAT GTC AGC ATG CAG 315 
Asn Leu His Cys Val Pro Val Glu Thr Ala Asn Val Thr Met Gin 
95 100 105 

CTG CTA AAG ATC CGT TCT GGG GAC CGG GGC TCG TAC GTG GAG CTG 360 
^ Leu Leu Lys He Arg_S.e.r_Gly_Asp^Arg^ro.^Ser---^Xyr— Val-^Glu^Leu-^ 



110 115 120 

ACG TTG TGT CAG CAC GTT CGC TGC GAA TGC CGG CCT CTG CGG GAG 4 05 
Thr Phe Ser Gin His Val Arg Cys Glu Cys Arg Pro Leu Arg Glu 
125 130 135 

AAG ATG AAG CCG GAA AGG AGG AGA GCC AAG GGC AGG GGG AAG AGG 4 50 
Lys Met Lys Pro Glu Arg Arg Arg Pro Lys Gly Arg Gly Lys Arg 
140 145 150 

AGG AGA GAG AAG TAG 4 65 

Arg Arg Glu Lys 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala 

5 10 15 

Gly Leu Ala Leu Pro Ala Val Pro Pro Gin Gin Trp Ala Leu Ser 
20 25 30 
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Ala Gly Asn Gly Ser S r Glu Val Glu Val Val Pro Phe Gin Glu 
35 40 45 

Val Trp Gly Arg Ser Tyr Cys Arg Ala Leu Glu Arg Leu Val Asp 
50 55 60 

Val Val Ser Glu Tyr Pro Ser Glu Val Glu His Met Phe Ser Pro 
65 70 75 

Ser Cys Val Ser Leu Leu Arg Cys Thr Gly Cys Cys Gly Asp Glu 
80 85 90 

Asn Leu His Cys Val Pro Val Glu Thr Ala Asn Val Thr Met Gin 
95 100 105 

Leu Leu Lys He Arg Ser Gly Asp Arg Pro Ser Tyr Val Glu Leu 
110 115 120 

Thr Phe Ser Gin His Val Arg Cys Glu Cys Arg Pro Leu Arg Glu 
125 130 135 

Lys Met Lys Pro Glu Arg Arg Arg Pro Lys Gly Arg Gly Lys Arg 
140 145 150 

Arg Arg Glu Lys 



Claims 

1. A purified and isolated DNA sequence encoding the C subunit of vascular endothelial cell growth factor. 

2. A purified and isolated vascular endothelial cell growth factor C subunit DNA sequence comprising: 



ATG 


CCG 


GTC 


ATG AGG 


CTG 


TTC CCT 


TGC 


TTC 


CTG 


CAG 


CTC 


CTG 


GCC 


GGG 


CTG 


GOG CTG 


CCT 


GOT GTG 


000 


000 


CAG 


CAG 


TOG 


GCC 


TTG 


TCT 


Gcr 


GOG AAC 


GGC 


TOG TCA 


GAG 


GTG 


GAA 


GTG 


GTA 


CCC 


TTC 


CAG 


GAA 


GTG TOG 


GQC 


OQC AGO 


TAC 


TGC 


CGG 


GOG 


CTG 


GAG 


AGG 


CTG 


GTG 


GAC GTC 


GTG 


TCC GAG 


TAC 


000 


AGC 


GAG 


GTG 


GAG 


CAC 


ATG 


TTC 


AQC OCA 


TOO 


TGT GTC 


TCC 


CTG 


CTG 


OQC 


TGC 


ACC 


GQC 


TQC 


TQC 


GQC GAT 


GAG 


AAT CTG 


CAC 


TGT 


GTG 


COG 


GTG 


GAG 


ACX3 


GOC 


AAT 


GTC AGO 


ATG 


CAG CTC 


CEA 


AAG 


ATC 


OGT 


TCT 


GGG 


GAC 


COG 


COC 


TOO TAC 


GTG 


GAG CTG 


AOG 


TTC 


TCT 


CAG 


CAC 


GTT 


OQC 


TGC 


GAA 


TQC OQG 


OCT 


CTG OQQ 


GAG 


AAG 


ATG 


AAG 


OOG 


GAA 


AGG 


AGG 


AGA 


000 AAG 


GQC 


AOG GGG 


AAG 


AOG 


AGG 


AGA 


GAG 


AAG 



TAG. SEQ ID N0:40 

3. Vascular ndothelial cell growth growth factor AC DNA comprising an A subunit DNA sequence and a C 
subunit DNA sequence. 

4. Vascular endothelial cell growth gr wth factor BC DNA comprising a B subunit DNA sequence and a C 
subunit DNA sequence. 
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10 



IS 



so 



5. A purified and isolated vascular endothelial cell growth factor C subunit DMA sequ nee comprising: 
ATG OOG GTC ATG AGG CTG TTC (XT TQC TTC CTG GAG CTC GIG 



GOC 


GGG 


CTG GOG 


CTG 


OCT OCT GTR 

Vi.'ViK'^ V^aa^X \J^\J 




nrr* 










TTG 


TCT 


GOT GOG 


AAC 


GGC TOR TOA 

VJV-A»^ X\..AJ XVailttX 




rrrri 




oJ-Vj 






TTC 


CAG 


GAA GTG 


TGG 


QQC Qon Ann 

V^s^Sxc VmAi^V^ X^SkJIK^ 














AQG 


CTG 


GTG GAG 


GIC 


GTG TOC GAG 


TAG 


ccc 


AGO 




GTG 


GAG 


cac 


ATG 


TTC AQC 


OCA 


TOC TGT GTC 


TOC 


CTG 


CTG 


OGC 


TGC 


ACC 


QQC 


TQC 


TQC QQC 


GAT 


GAG AAT CTG 


CAC 


TGT 


GTG 


OOG 


GTG 


GAG 


AOS 


GOC 


AAT GTC 


AGO 


ATG CAG CrC 


CTA 


AAG 


ATO 


OGT 


TCT 


GGG 


GftC 


OGG 


000 TOC 


TAG 


GTG GAG CTG 


ACQ 


TTC 


TCT 


CAG 


CAC 


GTT 


CGC 


TGC 


GAA TQC 


CGG 


OCT CTG OGG 


GAG 


AAG 


ATG 


AAG 


OOG 


GAA 


AGG 


AQG 


AGA 000 


AAG 


GOC AGG GOG 


AAG 


AGG 


AGG 


AGA 


GAG 


AAG 


CAG 


AGA 


OOC ACA 


GAC 


TGC CAC CTG 


TGC 


QQC 


GAT 


GOT 


GTT 


CCC 


03G 


AQG 


TAA. SE3Q ID NDS:29 & 40 













20 



Vascular endothelial cell growth factor AC DMA comprising an A subunit DNA sequence selected form 
-the-group-consisting-ofra-1DNA-sexiuBnce-i5ncodlrTg-arrl 89 acid fomi. a DNA sequence encoding 

an 165 amino acid fomi and a DNA sequence encoding a 121 amino acid fonm, with said A subunit DNA 
operably attached to a C subunit DNA sequence. 

7. Vascular endothelial cell growth factor BC DNA comprising a B subunit DNA sequence selected form the 
30 group consisting of a DNA sequence encoding a 1 35 amino acid fonm; and a DNA sequence encoding a 

115 amino acid fomi, with said B subunit DNA sequence operably attached to a C subunit DNA sequence. 

8. Homodimeric vascular endothelial growth factor DNA comprising C subunit DNA sequences. 

9. A vector containing the DNA sequence of any one of claims 3 to 8. 

35 

10. A host cell transfomned by the vector of daim 9 containing the DNA sequence encoding vascular endothe- 
lial cell growth factor. 

11. A process for the preparation of vascular endothelial cell growth factor comprising culturing the transfor- 
^ med host cell of daim 1 0 under conditions suitable for the expression of vascular endothelial cell growth 

factor and recovering vascular endothelial cell growth factor. 

12. Vascular endothelial growth factor made by the process of daim 1 1 . 

45 13. Vascular endothelial cell growth factor AC comprising an A subunit amino acid sequence and a C subunit 
amino acid sequence. 

14. Vascular endothelial cell growth factor BC comprising a B subunit amino acid sequence and a C subunit 
amino acid sequence. 



15. Vascular endothelial cell growth factor CC comprising a C subunit amino acid sequence and a C subunit 
amino acid sequence. 

16. A purified and isolat d vascular endothelial cell growth factor C subunit amino acid sequence comprising: 
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Pro 


vai 


Met 


Arg 


Leu 


Pne 


Pro 


cys Phe 


Leu 


Gin 


Leu 


Leu 


Ala 


Gly 


Leu 


Ala 


Leu 


Pro 


Ala 


Val 


Pro Pro 


Gin 


Gin 


Trp 


Ala 


Leu 


Ser 


Ala 


Gly 


Asn 


Gly 


Ser 


Ser 


Glu Val 


Glu 


Val 


Val 


Pro 


Pne 


Gin 


Glu 


Val 


Trp 


Gly 


Arg 


Ser 


Tyr cys 


Arg 


Ala 


Leu 


Glu 


Arg 


Leu 


Val 


Asp 


Val 


Val 


Ser 


Glu 


Tyr Pro 


Ser 


Glu 


Val 


Glu 


His 


rfet 


Pne 


Ser 


Pro 


Ser 


cys 


Val 


Ser Leu 


Leu 


Arg 


Cys 


Thr 


Gly 


Cys 


cys 


Gly 


Asp 


Glu 


Asn 


Leu 


His Cys 


Val 


Pro 


Val 


Glu 


Thr 


Ala 


Asn 


Val 


Thr 


lytet 


Gin 


Leu 


Leu Lys 


He 


Arg 


Ser 


Gly 


Asp 


Arg 


Pro 


Ser 


Tyr 


Val 


Glu 


Leu 


Thr Phe 


Ser 


Gin 


His 


Val 


Arg 


cys 


Glu 


Cys 


Arg 


Pro 


Leu 


Arg 


Glu Lys 


lyfet 


Lys 


Pro 


Glu 


Arg 


Acg 


Arg 


Pro 


Lys 


Gly 


Arg 


Gly 


Lys Arg 


Arg 


Arg 


Glu 


Lys. 



SEQ ID ND:41 

17. A purified and isolated vascular endothelial cell growth fector C subunit amino acid sequence comprising: 



Jfet 


Pro Val 


]Sfet 


Arg 


Leu 


Phe 


Pro 


cys 


Phe 


Leu 


Gin 


Leu 


Leu 


Ala 


Gly Leu 


Ala 


Leu 


Pro 


Ma 


Val 


Pro 


Pro 


Gin 


Gin 


Trp 


Ala 


Leu 


Ser Ala 


Gly 


Asn 


Gly 


Ser 


Ser 


Glu 


Val 


Glu 


Val 


Val 


Pro 


Phe 


Gin Glu 


Val 


Trp 


Gly 


Arg 


Ser 


Tyr 


cys 


Arg 


Ala 


Leu 


Glu 


Arg 


Leu Val 


Asp 


Val 


Val 


Ser 


Glu 


Tyr 


Pro 


Ser 


Glu 


Val 


Glu 


His 


l^t Phe 


Ser 


Pro 


Ser 


Cys 


Val 


Ser 


Leu 


Leu 


Arg 


cys 


Thr 


Gly 


cys cys 


Gly 


Asp 


Glu 


Asn 


Leu 


His 


cys 


Val 


Pro 


Val 


Glu 


Thr 


Ala Asn 


Val 


Thr 


lyfet 


Gin 


Leu 


Leu 


Lys 


He 


Arg 


Ser 


Gly 


Asp 


Arg Pro 


Ser 


Tyr 


Val 


Glu 


Leu 


Thr 


Phe 


Ser 


Gin 


His 


Val 


Arg 


Cys Glu 


Cys 


Arg 


Pro 


Leu 


Arg 


Glu 


Lys 


mt 


Lys 


Pro 


Glu 


Arg 


Arg Arg 


Pro 


Lys 


Gly 


Arg 


Gly 


Lys 


Arg 


Arg 


Arg 


Glu 


Lys 


Gin 


Arg Pro 


Thr 


Asp 


Cys 


His 


Leu 


Cys 


Gly 


Asp 


Ala 


Val 


Pro 



Arg Arg. SEQ ID NOS: 29 & 40 

18. A tissue repairing pharmaceutical composition comprising a pharmaceutical carrier and an effective tissue 
repairing amount of the purified vascular endothelial growth factor of any one of claims 13 to 15, 

19. The use of the vascular endothelial cell growth factor of any one of dainns 1 3 to 1 5 for the manufacture 
of a medicament for promoting tissue repair. 

20. The use of the vascular endothelial cell growth factor of any one of claims 1 3 to 1 5 for the manufacture 
of a medicament for stimulating vascular endothelial cell growth. 
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AOCA imSAACTlTCroCTCTCTTQGGrcQ^CTQGADCCTCGCTlT^ 49 
Met Asn Phe leu leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

CTGraCCICQCaffQCrAAGTQGTOCC2M3QCTGCA 94 
Ifiu Tyr leu His His Ala Lys Trp Ser Gin Ala Ala Pro Thr Thr 

20 25 30 

GAAGQGGAGCAGAAAGCX:CATGAAGTGGrGAAGTTCATCGftCGlC139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 

35 40 45 

TAG CAG OGC AGC TAT TGC OGT CXOS ATT GAG AOC CTG GTC GftC ATC 184 
Tyr Gin Arg Ser lyr Cys Arg Pro He Glu Thr Leu Val Asp He 

50 55 60 

TTC GAG GAG TAG OCC GAT GAG ATA GAG TAT ATC TTC AAG CX33 TOC 229 
Phe Gin Glu Tyr Pro Asp Glu He G lu Tyr lie, Ph^ Lys Pro Ser 



65 70 15 

TGT GTG GOG CTA ATG OGG TGT GCG GQC TGC TOG AAT GAT GAA GOG 274 
Cys Val Pro leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 

80 85 90 

GTG GAG TGC GTG OCC AOG TOG GAG AGC AAC GTG ACT A3G CAG ATC 319 
leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr bfet Gin He 

95 100 105 

ATGGQGATGAAAOCTCACCAAAGCCAGCAC ATA QGA GAG ATC AQC 364 
^fet Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 U5 120 

TTCCrGCAGCATAGCAGATCTGAATGCAGAOCAAAGAAAGATAGA409 
Phe leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 *^135 

ACA AAGOCAGAAAAATGrGACAAGOCAAQGOGGTGA 445 
Thr Lys Pro Glu Lys Cys Asp Lys Pro Arg Arg 
140 145 



FIG. 4 
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ATG AAC TTT CTG CIC TCT TOG GTG C3^ TGS PCC CIG OCT TTA CTG 49 
Met Asn Phe Leu Leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 IS 

CTG TAC CIC CAC CAT GOC AAG TOG TCC CAG OCT OCA OOC ADG ACA 94 
Leu Tyr Leu His His Ala Lys Tip Ser Gin Ala Ala Pro Thr Thr 
20 25 30 

GAA GGG GAG CAG AAA GOC CAT GAA GTG GTG AAG TTC ATG GAC GTC 139 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Met Asp Val 
35 40 45 

TAC CAG OGC AGC TAT TGC OGT OCG AIT GAG PCC CTG GTG GAC ATC 184 
Tyr Gin Arg Ser lyr CVs Arg Pro lie Glu Thr Leu Val Aq> lie 
50 55 60 

TTC CAG GAG TAC GOC GAT GAG ATA GAG TAT ATC TTC AAG GCG TGC 229 
Phe Gin Glu lyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 
65 70 "75 

TGT GTG OOC CTA ATG OQG TGT GOG GOC TGC TGC AAT GAT GAA GOC 274 
Cys Val Pro Leu Met Arg Cys Ala Gly Cys Cys Asn Asp Glu Ala 
80 85 90 

CTG GAG TGC GTC OOC ADG TOG GAG AGC AAC GTC ACT ATG CAG ATC 319 
Leu Glu cys Val Pro Thr Ser Glu Ser Asn Val Thr Met Gin lie 
95 100 105 

ATG OGG ATC AAA OCT CAC CAA AGO CAG CAC ATA GGA GAG ATG AGC 364 
Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

TTC CTG CAG CAT AGC AGA TCTT GAA TGC AGA CCA AAG AAA GAT AGA 409 
Phe Leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 

ACA AAG OCA GAA AAT CAC TGT GAG OCT TGT TCA GAG COS AGA AAG 454 
Thr Lys Pro Glu Asn His Cys Glu Pro Cys Ser Glu Arg Arg Lys 
140 145 150 

CAT TTC TTT GTC CAA GAT COG CAG ADG TGT AAA TGT TOO TGC AAA 499 
His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys 
155 160 165 

AAC ACA GAC TOG OGT TGC AAG GOG AGG CAG CTT GAG TTA AAC GAA 544 
Asn Thr Asp Ser Arg Cys Lys Ma Arg Gin Leu Glu Leu Asn Glu 
170 175 180 



OGT ACT TGC AGA TGT GAC AAG OCA AGG OQG TGA 
Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
185 190 



577 



FIG. 5 
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AACr ATG AAC TIT CTG CTC ICT TGG GTG TGG iO: CTG OCT TO^ 49 
Met Asn Phe leu leu Ser Trp Val His Trp Thr Leu Ala Leu Leu 

5 10 15 

CnGTACCTCCACCATQ0CAAGTGGTCCO«3GCTQCACXXAra 94 
l£u Tyr leu His His Ala Lys Trp Ser Gin Ala Ala Pro Ihr Thr 

20 25 30 

GAAGQGGAGCAGAAAQOCCATGAAGTGGTGAAGTTCAIGGACGrci39 
Glu Gly Glu Gin Lys Ala His Glu Val Val Lys Phe Kfet Asp Val 

35 40 45 

TAG Q« OQC AQC TAT TQC OCT CXX3 ATT G?^ ADC CTG GIG GAC ATC 184 
Tyr Gin Arg Ser lyr Cys Arg Pro He Glu Thr Leu Val Asp He 

50 55 60 

TTCCAGGAGTACOCrGATGAGAIAGAGTATATCTrcAAGCCGTOC229 
Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70— =15 

TGTGTG(XX:CTAATGa3GTCTQCX3GQCTGCTQCAATGATGAAGOC274 
Cys Val Pro leu Met Arg cys Ala Gly Cys Cys Asn Asp Glu Ala 

80 85 90 

CTGGAGTQCGTGC0CACX3T0GGAGA3CAACGICACTAIGCAGATC319 
leu Glu Cys Val Pro Thr Ser Glu Ser Asn Val Thr l^t Gin He 

95 100 105 

ATG OGGATCAAAOCTCACCAAAGCCAGCACAIAGGAGAG AIX3 AQC 364 
Met Arg He Lys Pro His Gin Ser Gin His He Gly Glu Met Ser 
110 115 120 

TTCCTGCAGCATAGCAGATCTGAATGC AGA OCA AAG AAA GAT AGA 409 
Phe leu Gin His Ser Arg Cys Glu Cys Arg Pro Lys Lys Asp Arg 
125 130 135 

ACAAAGOCAGAAAAAAAATCAGTTOGAGGAAAGGGAAAGGGTCAA454 
Thr Lys Pro Glu Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin 
140 145 150 

AAA OGA AAG OQC AAG AAA TOC OGG TTT AAA TOO TGG AGC GTT CAC 499 
Lys Arg Lys Arg Lys Lys Ser Arg Phe Lys Ser Trp Ser Val His 
155 160 165 

FIG. 6A 
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TGT GAG OCT TGT TCA GftG CX3G AGA AAG CAT TTG TTT GTC CAA GAT 544 
O/s Glu Pro Cys Ser Glu Arg Aig Lys His Leu Phe Val Gin Asp 
170 175 180 

OCX; GAG AOG TGT AAA TCT TOC TGC AAA AAC ACA GAC TOG OGT TQC 589 
Pro Gin Thr C^s Lys Cys Ser Cys lys Asn Thx Asp Ser Arg Cys 
185 190 195 

AAG OOG AGG GAG CTT GAG ITA AAC GAA OGT ACT TQC AGA TGT GAC 634 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg 7hr Cys Arg Cys Asp 
200 205 210 

AAG CCA AGG OGG TGA 649 
Lys Pro Arg Arg 



FIG.6B 
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ATG CTG GCC ATG PAG CTG TTC ACT TQC TTC TTG CAG GIC CTA OCT 45 
Met Leu Ala Mat Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 

GGG TTG OCT GTG to: (3^ GGG GCX: (^G TCT OCT GOG AAC AAC 90 
Gly I«u Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 

20 25 30 

TCA ACA GAA ATG GAA GTG GIG OCT ITC AAT GAA GTG TGG GQC CGC 135 
Ser Thr Glu Met Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 

35 40 45 

AGCTftCTQCCXSGOCAATGGAGAAGCTGGTGTACAlTGCAGATGAA 180 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val lyr He Ala Asp Glu 

50 55 © 

C?C(XTAATGAAGTGTCTQa:ATATrCAGrO0GTCATGrGTCOT 225 

-Hi-s-Pro-Asn-Glu V al— Ser-H±s-lie-aie-Ser-Pro-Ser-Cys~Val~Leu 

65 70 75 

CTG AGT CGC TCT AGT GGC TQC TGT QGT GAC GAG QGT CTG CAC TCT 270 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 

80 85 90 

GTG GOG CTA AAG ACA GCC AAC ATC ACT ATG CAG ATC TTA AAG ATT 315 
Val Ala Leu Lys Thr Ala Asn He Thr Met Gin He Leu Lys He 

95 100 105 

COC OOC AAT CGG GAT CCA CAT TOC TAC GTG GAG ATG ACA TTC TCI 360 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 115 120 

CAGGATGTACTCTQCGAATQCAGGOCTArrCrGGAGAOGACA AAG 405 
Gin Asp Val Leu Cys Glu Cys Arg Pro He Leu Glu Thr Thr Lys 
125 130 135 

GCA GAA AOS TAA 417 
Ala Glu Arg 



FIG. 7 
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ATG CTG GOC MG AAG CTG TTC ACT TQC TTC TTG GAG GIC CIA OCT 
Met Leu Ala Met Lys Leu Phe Thr Cys Phe Leu Gin Val Leu Ala 

5 10 15 



45 



GGG TTG GCT GIG CAC TOC GAG QQG GOC CTG TCT GCT GOG AAC AAC 
Gly Leu Ala Val His Ser Gin Gly Ala Leu Ser Ala Gly Asn Asn 

20 25 30 



90 



TCA ACA GAA ATG GAA GTG GIG (XT TTC AAT GAA GTG TOG GGC OQC 
Ser Thr Glu ^fet Glu Val Val Pro Phe Asn Glu Val Trp Gly Arg 

35 40 45 



135 



AGC TAC TGC CGG OCA ATG GAG AAG CTG GTG TAC ATT GCA GAT GAA 
Ser Tyr Cys Arg Pro Met Glu Lys Leu Val Tyr lie Ala Asp Glu 

50 55 63 



180 



CAC CCT AAT GAA GTG TCT CAT ATA TTC ACT COG TCA TCT GTC CIT 
His Pro Asn Glu Val Ser His lie Phe Ser Pro Ser Cys Val Leu 

65 70 -75 



225 



CTG ACT CGC TCT ACT GGC TGC TCT GCT GAC GAG GCT CTG CAC TCT 
Leu Ser Arg Cys Ser Gly Cys Cys Gly Asp Glu Gly Leu His Cys 

80 85 90 



270 



GTG GOG CEA AAG ACA GOC AAC ATC ACT ATG CAG ATC TEA AAG ATT 
Val Ala Leu Lys Thr Ala Asn He Thr Met Gin He Leu Lys He 

95 100 105 



315 



COC 000 AAT CGG GAT OCA CAT TOC TAC GTG GAG ATG ACA TTC TCT 
Pro Pro Asn Arg Asp Pro His Ser Tyr Val Glu Met Thr Phe Ser 
110 115 120 



360 



CAG GAT CTA CTC TGC GAA TGC AQG OCT ATT CTG GAG AOG ACA AAG 
Gin Asp Val Leu Cys Glu Cys Arg Pro He Leu Glu Thr Ihr Lys 
125 130 135 



405 



GCA GAA AGG AGG AAA ACC AAG GOG AAG AGG AAG CAA AGC AAA AOC 
Ala Glu Arg Arg Lys "nir Lys Gly lys Arg Lys Gin Ser Lys Thr 
140 145 150 



450 



CCA CAG ACT GAG GAA COC CAC CTG TGA 
Pro Gin Thr Glu Glu Pro His Leu 
155 



FIG. 8 
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ATG COG GTC ATG AQG CTG TTC OCT TQC TIC CTG CAG CTC CTG QOC 45 
Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gin Leu Leu Ala 

5 10 15 

GGG CTG GOG CTG OCT GCT GTG OOC OOC CAG CAG TOG GOC TTG TCT 90 
Gly leu Ala Leu Pro Ala Val Pro Pro Gin Gin Txp Ala Leu Ser 

20 25 30 

GCT GOG AAC QOC TOG TCA GAG GTG GRA CTG GTA OOC TTC CAG GAA 135 
Ala Gly Pisn Gly Ser Ser Glu Val Glu Val Val Pro Phe Gin Glu 

35 40 45 

GTG TOG GOC OGC AGO TAC TGC COG GOG CTG GAG AGG CTG GTG GAC 180 
Val Trp Gly Arg Ser Tyr Cys Arg Ala Leu Glu Arg Leu Val Asp 

50 55 60 

GTC GTG TOC GAG TAC COC AGO GAG GTG GAG CAC ATG TTC AGC OCA 225 
Val Val Ser Glu lyr Pro Ser Glu Val Glu His Met Phe Ser Pro 

65 70 15 



TOC TGT GTC TCC CTG CTG OGC TGC AOC GOC TQC TQC GQC GAT GAG 270 
Ser Cys Val Ser Leu Leu Arg Cys Thr Gly Cys Cys Gly Asp Glu 

80 85 90 

AAT CTG CAC TGT GTG COG GTG GAG AOG GOC AAT GTC AOC ATG CAG 315 
Asn Ifiu His Cys Val Pro Val Glu Thr Ala Asn Val Ihr Met Gin 

95 100 105 

CTC CIA AAG ATC OCT TCT GGG GAC COG OOC TOC TAC GTG GAG CTG 360 
Ifiu Ifiu Lys He Arg Ser Gly Asp Arg Pro Ser Tyr Val Glu Leu 
110 115 120 

AOG TIC TCT CAG CAC GTT OGC TGC GAA TGC OGG OCT CTG OGG GAG 405 
Thr Phe Ser Gin His Val Arg Cys Glu Cys Arg Pro Leu Arg Glu 
125 130 135 

AAG ATG AAG OOG GAA AGG AGG AGA OOC AAG QOC ?£jG GGG AAG AOG 450 
Lys Met Lys Pro Glu Arg Arg Arg Pro Lys Gly Arg Gly Lys Arg 
140 145 150 

AGG AGA GAG AAG TAG 465 
Arg Arg Glu Lys 



FIG. 9 
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